AI + ML

This article is more than 1 year old

Time to start taking machine-learning security seriously, Microsoft boffin insists

Forget academic adversarial attacks and imagine if ML were the weakest link in the chain

Wed 3 Feb 2021 // 07:05 UTC

Enigma When Microsoft surveyed 28 organizations last year about how they viewed machine learning (ML) security, its researchers found that few firms gave the matter much thought.

"As a result, our collective security posture is close to zero," said Hyrum Anderson, principal architect in the Azure trustworthy machine learning group at Microsoft, during a presentation at USENIX's Enigma 2021 virtual conference.

Pointing to Microsoft's survey [PDF], Anderson said almost 90 per cent of organizations – 25 out of 28 – didn't know how to secure their ML systems.

The issue for many of these companies is that commonly cited attacks on ML systems – like an adversarial attack that makes an ML image recognition model classify a tabby cat as guacamole – are considered too speculative and futuristic in light of ongoing attacks that need to be dealt on a frequent basis, like phishing and ransomware.

But Anderson argues that ML systems shouldn't be considered in isolation. Rather, they should be thought of as cogs in a larger system that can be sabotaged, a scenario that has ramifications beyond the integrity of a specific ML model.

"For example, if an adversary wanted to commit expense fraud, she could do this by digitally altering real receipts to fool an automated system similar to the tabby cat and guacamole example," he said. "However, a much easier thing to do is to simply submit valid receipts to the automated system that do not represent legitimate business expenses."

In other words, securing ML models is a necessary step to defend against more commonplace risks.

The fate of Microsoft's Tay twitter chatbot illustrates why ML security should be seen as a practical matter rather than an academic exercise. Launched in 2016 as interactive entertainment, Tay had been programmed to learn language from user input. Within 24 hours, Tay was parroting toxic input from online trolls and was subsequently deactivated.

Call in the squad

Nowadays, having learned to take ML security seriously, Microsoft conducts red team exercises against its ML models. A red team exercise refers to an internal team playing the role of an attacking entity to test the target organization's defenses.

Anderson recounted one such red team inquiry conducted against an internal Azure resource provisioning service that uses ML to dole out virtual machines to Microsoft employees.

Microsoft relies on a web portal to allocate physical server space for virtualized computing resources. The savings for doing so at a company with over 160,000 employees can be significant, said Anderson.

Sick of AI engines scraping your pics for facial recognition? Here's a way to Fawkes them right up

"In our red team engagement, we took the role of an adversary who wishes to cause an indiscriminate denial of service attack through a so-called noisy neighbor attack, by tricking the system to deploy hungry resources on physical hardware that contained high availability service containers," he explained. "Evading the ML model for this is a linchpin for this exercise."

The goal of the exercise was to determine if the red team could cause a system availability violation through an ML integrity violation. And the exercise was conducted without direct access to the ML model being used to allocate computing resources.

During the reconnaissance phase of the exercise, the red team found that its credentials gave it access to two critical pieces of information, Anderson recounted. First was that the team had read-only access to training data for the model and second was that the team found details about how the model handled data featurization – the conversion of data into numerical vectors.

That alone, he explained, was enough to allow the red team to construct its own ML model to test its attack offline.

Using a replica ML model they were able to construct, the red team managed to identify a number of "evasive variants." These are inputs to the ML model that would guarantee when the model would predict a resource request would be excessive. Anderson said the team identified various resource configurations that could be requested to make the ML model see a friendly, low-resource container as one that's oversubscribed.

"Having those resource requests that would guarantee an oversubscribed condition, we can then instrument a virtual machine, for example, with hungry resource payloads, high CPU utilization, and memory usage that would then be over-provisioned and cause a denial of service to the other containers on the same physical host," he explained.

Anderson said there are several things that should be taken from this example. First, he said, internal ML models are not safe by default.

"Even though a model may not be directly accessible to the outside world, there are paths by which an attacker can exploit it to cause cascading downstream effects in an overall system," he said.

Second, he said, permissive access to data or code can lead to model theft.

"This seems really simple, but ask your data science team how they set up permissions around their data and their code," he said.

Third, check the model output before taking prescriptive actions, which is another way of saying include a sanity check in your system. That might mean auditing one out of every thousand outputs, including a human in the loop periodically, or simply implementing guardrails like never oversubscribing a 24-core VM no matter what the model predicts.

And fourth, make sure to log model behavior and development during training and deployment. "Even if there's no active program or person monitoring those logs in real time, we should always have an assumed breach mindset," Anderson said. ®

More about

Microsoft

More about

Microsoft

Narrower topics

Narrower topics

Broader topics

Bill Gates

TIP US OFF

Send us news

Topics

Special Features

Vendor Voice

Resources

AI + ML

Time to start taking machine-learning security seriously, Microsoft boffin insists

Forget academic adversarial attacks and imagine if ML were the weakest link in the chain

Call in the squad

Sick of AI engines scraping your pics for facial recognition? Here's a way to Fawkes them right up

More about

More about

Narrower topics

Broader topics

More about

More about

More about

Narrower topics

Broader topics

TIP US OFF

Other stories you might like

Researchers claim Windows Defender can be fooled into deleting databases

October 2025 will be a support massacre for a bunch of Microsoft products

Microsoft is a national security threat, says ex-White House cyber policy director

Industrial systems integrating digitalisation

Open source versus Microsoft: The new rebellion begins

Microsoft breach allowed Russian spies to steal emails from US government

Microsoft shrinks AI down to pocket size with Phi-3 Mini

Microsoft claims it didn't mean to inject Copilot into Windows Server 2022 this week

AI gold rush continues as Microsoft invests $1.5B in UAE's G42

Microsoft to use Windows 11 Start menu as a billboard with app ads for Insiders

Microsoft teases deepfake AI that's too powerful to release

Microsoft aims to triple datacenter capacity to fuel AI boom

About Us

Our Websites

Your Privacy