AI + ML

This article is more than 1 year old

Yahoo! couldn't! detect! hackers! in! its! network! but! can! spot! NSFW! smut! in! your! office?

Web giant offers open-source AI-powered X-rated pic hunter

Thomas Claburn

Fri 30 Sep 2016 // 18:56 UTC

Having laid bare over half a billion usernames and passwords through meager funding and witless indifference, Yahoo! is putting its faith in artificial intelligence to protect people from bare skin.

Yahoo! engineers Jay Mahadeokar and Gerry Pesavento in a blog post on Friday said the company has released an open-source model for detecting images deemed "not safe for work" (NSFW).

"To the best of our knowledge, there is no open source model or algorithm for identifying NSFW images," the pair wrote. "In the spirit of collaboration and with the hope of advancing this endeavor, we are releasing our deep learning model that will allow developers to experiment with a classifier for NSFW detection, and provide feedback to us on ways to improve the classifier."

Censorship has been something of a losing proposition for Yahoo!, from its role in the imprisonment of Chinese journalist Shi Tao over a decade ago to its over-enthusiastic spam filters. Nonetheless, the researchers argue that the prevalence of user-generated content makes filtering NSFW images essential for web and mobile applications.

It may be essential for business models that rely on free labor producing content under the pretense of sharing, but that turns out to describe quite a number of internet companies. Alternatively, this software is going to be great for finding and identifying raunchy material on the web.

Yahoo!'s software is a neural network model for Caffe, a deep-learning framework. There are other frameworks that experts in the field rate more highly, such as Torch. Yahoo! also relies on CaffeOnSpark, a framework for running Caffe on Hadoop and Spark clusters.

The NSFW model is designed to take an image and output a smut probability between zero and one, though Mahadeokar and Pesavento note, "we do not provide guarantees of accuracy of output."

Yahoo!'s researchers have declined to release their training images "due to the nature of the data," leaving readers to task of amassing a sufficiently large cache of indiscreet pictures to allow their computers to categorize what they're seeing accurately.

One source might be Google's newly released Open Images library, a dataset of some 9 million URLs pointing at images which may or may not be subject to a Creative Commons Attribution license – Google advises verifying the licensing status of each image. Google intends for its dataset, produced in collaboration with CMU and Cornell universities, to help train neural networks.

We're still waiting to hear from Google whether there are any images in the dataset that would warrant a Yahoo exclamation mark. ®

More about

×

More about

Narrower topics

Broader topics

Self-driving Car

More about

More about

×

More about

Narrower topics

Broader topics

Self-driving Car

TIP US OFF

Other stories you might like

Google Cloud chief is really psyched about this AI thing

Cloud Next We're on a highway to ML

PaaS + IaaS9 Apr 2024 | 6

AI spam is winning the battle against search engine quality

interview 'Not all AI content is spam, but I think right now all spam is AI content'

AI + ML13 Apr 2024 | 35

Arm flexes silicon muscles to push generative AI at the edge

Ethos-U85 microNPU boasts 4x performance boost over previous gen

Systems9 Apr 2024 | 1

Protecting distributed branch office environments from ransomware

As ransomware becomes more sophisticated, detection tools should be upgraded to cover every site and location

Sponsored Feature

What's up with AI lately? Let's start with soaring costs, public anger, regulations...

'Obtaining genuine consent for training data collection is especially challenging' industry sages say

AI + ML15 Apr 2024 | 11

Developers are calling the shots on AI planning, judging by your experience

Reader Poll Results And American CIOs keep a closer eye on the purse strings than European equivalents

AI + ML8 Apr 2024 |

Psst, hey. It's the NSA. You want some AI security advice?

You can trust us, we're the good guys

AI + ML17 Apr 2024 | 1

Why making pretend people with AGI is a waste of energy

Interview Industrial revolution didn't give us human mimics, so why should AI think like us, this computer scientist wonders

AI + ML14 Apr 2024 | 60

Intel CEO suggests AI can help to create a one-person Unicorn

Intel Vision And possibly replace entire business units too

AI + ML10 Apr 2024 | 37

Microsoft puts ex-DeepMind boffin in charge of London AI hub

Follows £2.5 billion pledge to 'upskill' British workers for the new world order

AI + ML8 Apr 2024 | 2

US House mulls forcing AI makers to reveal use of copyrighted training data

Proposed law doesn't include any ban on use of such stuff to build models, mind you

AI + ML10 Apr 2024 | 16

Hailo's latest AI chip shows up integrated NPUs and sips power like fine wine

All your PC needs for 40 TOPS is an M.2 slot

Systems10 Apr 2024 | 6

The Register Biting the hand that feeds IT

About Us

Our Websites

Your Privacy

Situation Publishing

Copyright. All rights reserved © 1998–2024