Yahoo! couldn't! detect! hackers! in! its! network! but! can! spot! NSFW! smut! in! your! office?

Web giant offers open-source AI-powered X-rated pic hunter


Having laid bare over half a billion usernames and passwords through meager funding and witless indifference, Yahoo! is putting its faith in artificial intelligence to protect people from bare skin.

Yahoo! engineers Jay Mahadeokar and Gerry Pesavento in a blog post on Friday said the company has released an open-source model for detecting images deemed "not safe for work" (NSFW).

"To the best of our knowledge, there is no open source model or algorithm for identifying NSFW images," the pair wrote. "In the spirit of collaboration and with the hope of advancing this endeavor, we are releasing our deep learning model that will allow developers to experiment with a classifier for NSFW detection, and provide feedback to us on ways to improve the classifier."

Censorship has been something of a losing proposition for Yahoo!, from its role in the imprisonment of Chinese journalist Shi Tao over a decade ago to its over-enthusiastic spam filters. Nonetheless, the researchers argue that the prevalence of user-generated content makes filtering NSFW images essential for web and mobile applications.

It may be essential for business models that rely on free labor producing content under the pretense of sharing, but that turns out to describe quite a number of internet companies. Alternatively, this software is going to be great for finding and identifying raunchy material on the web.

Yahoo!'s software is a neural network model for Caffe, a deep-learning framework. There are other frameworks that experts in the field rate more highly, such as Torch. Yahoo! also relies on CaffeOnSpark, a framework for running Caffe on Hadoop and Spark clusters.

The NSFW model is designed to take an image and output a smut probability between zero and one, though Mahadeokar and Pesavento note, "we do not provide guarantees of accuracy of output."

Yahoo!'s researchers have declined to release their training images "due to the nature of the data," leaving readers to task of amassing a sufficiently large cache of indiscreet pictures to allow their computers to categorize what they're seeing accurately.

One source might be Google's newly released Open Images library, a dataset of some 9 million URLs pointing at images which may or may not be subject to a Creative Commons Attribution license – Google advises verifying the licensing status of each image. Google intends for its dataset, produced in collaboration with CMU and Cornell universities, to help train neural networks.

We're still waiting to hear from Google whether there are any images in the dataset that would warrant a Yahoo exclamation mark. ®

Similar topics


Other stories you might like

  • Prisons transcribe private phone calls with inmates using speech-to-text AI

    Plus: A drug designed by machine learning algorithms to treat liver disease reaches human clinical trials and more

    In brief Prisons around the US are installing AI speech-to-text models to automatically transcribe conversations with inmates during their phone calls.

    A series of contracts and emails from eight different states revealed how Verus, an AI application developed by LEO Technologies and based on a speech-to-text system offered by Amazon, was used to eavesdrop on prisoners’ phone calls.

    In a sales pitch, LEO’s CEO James Sexton told officials working for a jail in Cook County, Illinois, that one of its customers in Calhoun County, Alabama, uses the software to protect prisons from getting sued, according to an investigation by the Thomson Reuters Foundation.

    Continue reading
  • Battlefield 2042: Please don't be the death knell of the franchise, please don't be the death knell of the franchise

    Another terrible launch, but DICE is already working on improvements

    The RPG Greetings, traveller, and welcome back to The Register Plays Games, our monthly gaming column. Since the last edition on New World, we hit level cap and the "endgame". Around this time, item duping exploits became rife and every attempt Amazon Games made to fix it just broke something else. The post-level 60 "watermark" system for gear drops is also infuriating and tedious, but not something we were able to address in the column. So bear these things in mind if you were ever tempted. On that note, it's time to look at another newly released shit show – Battlefield 2042.

    I wanted to love Battlefield 2042, I really did. After the bum note of the first-person shooter (FPS) franchise's return to Second World War theatres with Battlefield V (2018), I stupidly assumed the next entry from EA-owned Swedish developer DICE would be a return to form. I was wrong.

    The multiplayer military FPS market is dominated by two forces: Activision's Call of Duty (COD) series and EA's Battlefield. Fans of each franchise are loyal to the point of zealotry with little crossover between player bases. Here's where I stand: COD jumped the shark with Modern Warfare 2 in 2009. It's flip-flopped from WW2 to present-day combat and back again, tried sci-fi, and even the Battle Royale trend with the free-to-play Call of Duty: Warzone (2020), which has been thoroughly ruined by hackers and developer inaction.

    Continue reading
  • American diplomats' iPhones reportedly compromised by NSO Group intrusion software

    Reuters claims nine State Department employees outside the US had their devices hacked

    The Apple iPhones of at least nine US State Department officials were compromised by an unidentified entity using NSO Group's Pegasus spyware, according to a report published Friday by Reuters.

    NSO Group in an email to The Register said it has blocked an unnamed customers' access to its system upon receiving an inquiry about the incident but has yet to confirm whether its software was involved.

    "Once the inquiry was received, and before any investigation under our compliance policy, we have decided to immediately terminate relevant customers’ access to the system, due to the severity of the allegations," an NSO spokesperson told The Register in an email. "To this point, we haven’t received any information nor the phone numbers, nor any indication that NSO’s tools were used in this case."

    Continue reading

Biting the hand that feeds IT © 1998–2021