AI + ML

This article is more than 1 year old

Can your AI code be fooled by vandalized images or clever wording? Microsoft open sources a tool to test for that

Counterfit automatically creates adversarial inputs to find weaknesses

Wed 5 May 2021 // 23:27 UTC

Microsoft this week released a Python tool that probes AI models to see if they can be hoodwinked by malicious input data.

And by that, we mean investigating whether, say, an airport's object-recognition system can be fooled into thinking a gun is a hairbrush, or a bank's machine-learning-based anti-fraud code can be made to approve dodgy transactions, or a web forum moderation bot can be tricked into allowing through banned hate speech.

The Windows giant's tool, dubbed Counterfit, is available on GitHub under the MIT license, and is command-line controlled. Essentially, the script can be instructed to delve into a sizable toolbox of programs that automatically generate thousands of adversarial inputs for a given AI model under test. If the output from the model differs from what was expected from the input, then this is recorded as a successful attack.

For example, if a model is shown a slightly altered picture of a car and it predicts it's a pedestrian, then that's a win for Counterfit and a vulnerability in the model identified. The goal is to reveal any weak spots in the machine-learning system under test.

You can test models you've trained yourself, or black-box models you've acquired from network edge devices, mobile applications, or academic projects. Inputs can be text – for probing, say, sentiment analysis systems – or images for computer-vision apps or audio for, say, transcription services.

For instance, Counterfit's documentation and code includes a tutorial involving a pretrained model taught to identify handwritten numbers from the MNIST database of scribbles. The tutorial shows you how to set up Counterfit to use the so-called Hop-Skip-Jump technique, implemented in the Adversarial Robustness Toolbox, to slightly modify a picture of the number 5 so that the model think it's a 3.

“This tool was born out of our own need to assess Microsoft’s AI systems for vulnerabilities with the goal of proactively securing AI services, in accordance with Microsoft’s responsible AI principles and Responsible AI Strategy in Engineering (RAISE) initiative,” Redmond's Will Pearce and Ram Shankar Siva Kumar said in a blog post.

“Counterfit started as a corpus of attack scripts written specifically to target individual AI models, and then morphed into a generic automation tool to attack multiple AI systems at scale.”

A record of successful attempts to fool the given model under test is logged so developers can inspect the inputs to see where they need to shore up their software – or for attackers and penetration-testers to identify where and how to hit a program.

Microsoft said its own AI red team uses Counterfit to probe the tech giant's algorithms in production, and is looking for ways to adapt the tool so it can scan automatically models for vulnerabilities before they’re deployed. ®

Topics

Special Features

Vendor Voice

Resources

AI + ML

Can your AI code be fooled by vandalized images or clever wording? Microsoft open sources a tool to test for that

Counterfit automatically creates adversarial inputs to find weaknesses

More about

More about

Narrower topics

Broader topics

More about

More about

More about

Narrower topics

Broader topics

TIP US OFF

Other stories you might like

Microsoft shrinks AI down to pocket size with Phi-3 Mini

Microsoft teases deepfake AI that's too powerful to release

AI gold rush continues as Microsoft invests $1.5B in UAE's G42

Reducing the cloud security overhead

Microsoft aims to triple datacenter capacity to fuel AI boom

Microsoft squashes SmartScreen security bypass bug exploited in the wild

Microsoft puts ex-DeepMind boffin in charge of London AI hub

Microsoft cannot keep its own security in order, so what hope for its add-ons customers?

Psst, hey. It's the NSA. You want some AI security advice?

Microsoft foresees a new type of AI PC: A Surface designed with help from machines

Microsoft slammed for lax security that led to China's cyber-raid on Exchange Online

Microsoft is a national security threat, says ex-White House cyber policy director

About Us

Our Websites

Your Privacy