Google Cloud's API knows the sort of thing you like to look at

And now has creepy 'entity' sentiment analysis too


Google Cloud's Natural Language API has become a bit more, er, insightful: it can now sort content into 700 different categories, such as Health, Hobbies & Leisure and Law & Government.

This clustering allows businesses to tailor the info they deliver to customers to match those customers' preferences. Google also added the ability to determine whether a specific entity in text is spoken about in a positive or negative light (previously the API could only find the sentiment in sentences or blocks of text).

While Google says text-clustering would help publishers such as Hearst "understand what their audience is reading and how their content is being used" it might not be all sunny skies. Researchers warn that it raises some privacy concerns.

"If I know the tweets and news and other texts you consume, and I cluster them, then I can very quickly determine your set of interests / sentiments / whatever clustering regime is applied," Eduard Hovy, a natural language processing expert at Carnegie Mellon University in Pittsburgh, Pennsylvania, told The Register.

Tailoring content to preferences could help customers quickly find relevant info, he said, but "on the down side, this creates confirmation bubbles that lead you to believe that the whole world thinks like you do, and that any opposite views you might encounter are just random crazies out there, not the majority."

He added on a more general level that there are a lot of things that can be deduced by clustering user preferences that could be used in unsavoury ways by businesses and governments or ransomed by hackers.

And although businesses might argue they're targeting "classes" of people, not individuals, if enough clusters intersect and more data is extracted, "you can sometimes pinpoint an exact home," he said.

"And further on the down side, [collating such data] means a hacker or a government that gets hold of it might know more about you than you’d like, potentially for blackmail, insurance denial, etc.

"The question is: do you trust Google? Or Amazon? Or whoever it is who knows what text/images/video you consume?" he said.

Google has not responded to a request for comment. ®

Similar topics

Broader topics


Other stories you might like

  • Experts: AI should be recognized as inventors in patent law
    Plus: Police release deepfake of murdered teen in cold case, and more

    In-brief Governments around the world should pass intellectual property laws that grant rights to AI systems, two academics at the University of New South Wales in Australia argued.

    Alexandra George, and Toby Walsh, professors of law and AI, respectively, believe failing to recognize machines as inventors could have long-lasting impacts on economies and societies. 

    "If courts and governments decide that AI-made inventions cannot be patented, the implications could be huge," they wrote in a comment article published in Nature. "Funders and businesses would be less incentivized to pursue useful research using AI inventors when a return on their investment could be limited. Society could miss out on the development of worthwhile and life-saving inventions."

    Continue reading
  • Declassified and released: More secret files on US govt's emergency doomsday powers
    Nuke incoming? Quick break out the plans for rationing, censorship, property seizures, and more

    More papers describing the orders and messages the US President can issue in the event of apocalyptic crises, such as a devastating nuclear attack, have been declassified and released for all to see.

    These government files are part of a larger collection of records that discuss the nature, reach, and use of secret Presidential Emergency Action Documents: these are executive orders, announcements, and statements to Congress that are all ready to sign and send out as soon as a doomsday scenario occurs. PEADs are supposed to give America's commander-in-chief immediate extraordinary powers to overcome extraordinary events.

    PEADs have never been declassified or revealed before. They remain hush-hush, and their exact details are not publicly known.

    Continue reading
  • Stolen university credentials up for sale by Russian crooks, FBI warns
    Forget dark-web souks, thousands of these are already being traded on public bazaars

    Russian crooks are selling network credentials and virtual private network access for a "multitude" of US universities and colleges on criminal marketplaces, according to the FBI.

    According to a warning issued on Thursday, these stolen credentials sell for thousands of dollars on both dark web and public internet forums, and could lead to subsequent cyberattacks against individual employees or the schools themselves.

    "The exposure of usernames and passwords can lead to brute force credential stuffing computer network attacks, whereby attackers attempt logins across various internet sites or exploit them for subsequent cyber attacks as criminal actors take advantage of users recycling the same credentials across multiple accounts, internet sites, and services," the Feds' alert [PDF] said.

    Continue reading

Biting the hand that feeds IT © 1998–2022