Microsoft chatbots: Sweet XiaoIce vs foul-mouthed Tay

Cultural differences, eh?


AI chatbots can act like social experiments, offering a glimpse into human culture – for the good or the bad.

Microsoft and Bing researchers found this out when they trialled their chatbots on China’s hugely successful messaging platform, WeChat, and on Twitter.

The Chinese chatbot, XiaoIce, went viral within 72 hours and has over 40 million users in China and Japan.

Distinguished engineer and general manager of Future Social Experiences Labs at Microsoft, Lili Cheng, was part of a team that built XiaoIce. Following the success in China, Microsoft decided to try it on US Twitter. “What could go wrong?,” she said at a presentation at the O’Reilly Artificial Intelligence conference.

The audience laughed because they knew what went wrong. Microsoft’s Twitter bot, Tay, rapidly descended into a racist, sexist wreck. Tay was pulled from the internet and Microsoft issued an apology shortly after.

Whilst XiaoIce was "acting" cute and had functions to help users fall asleep by counting sheep or recognising different breeds of dogs, Tay was busily denying the Holocaust.

Both chatbots had learned how to interact by mining the internet for conversations on social media. But Tay was manipulated into being offensive because it was attacked, said Peter Lee, Corporate Vice President at Microsoft Research.

“Unfortunately, in the first 24 hours of coming online, a coordinated attack by a subset of people exploited a vulnerability in Tay.”

Microsoft did not expect Tay to behave this way. But Cheng told The Register that, in hindsight, she understood why Tay turned out that way... and it wasn't necessarily because users in China were kinder.

“Twitter has a lot of trolls,” Cheng said. "Even if negative, America strongly believes in free speech, which is included its constitution. In China, however, there is less freedom as the government controls the internet and goes as far as censoring particular words online."

Words such as "oppression" are blocked in China, but on Twitter are freely used – often bandied about during intense debate – and of course trolling – when users post about feminism or racial issues.

"Our cultural stories shape how we interact with AI. Addressing societal bias is critical for us to better design conversational AI and deepens my optimism in AI and the advances we'll experience as more people have access to AI tools such as Microsoft Botframework and many more," Cheng told The Register. ®

Similar topics

Broader topics


Other stories you might like

  • Robotics and 5G to spur growth of SoC industry – report
    Big OEMs hogging production and COVID causing supply issues

    The system-on-chip (SoC) side of the semiconductor industry is poised for growth between now and 2026, when it's predicted to be worth $6.85 billion, according to an analyst's report. 

    Chances are good that there's an SoC-powered device within arm's reach of you: the tiny integrated circuits contain everything needed for a basic computer, leading to their proliferation in mobile, IoT and smart devices. 

    The report predicting the growth comes from advisory biz Technavio, which looked at a long list of companies in the SoC market. Vendors it analyzed include Apple, Broadcom, Intel, Nvidia, TSMC, Toshiba, and more. The company predicts that much of the growth between now and 2026 will stem primarily from robotics and 5G. 

    Continue reading
  • Deepfake attacks can easily trick live facial recognition systems online
    Plus: Next PyTorch release will support Apple GPUs so devs can train neural networks on their own laptops

    In brief Miscreants can easily steal someone else's identity by tricking live facial recognition software using deepfakes, according to a new report.

    Sensity AI, a startup focused on tackling identity fraud, carried out a series of pretend attacks. Engineers scanned the image of someone from an ID card, and mapped their likeness onto another person's face. Sensity then tested whether they could breach live facial recognition systems by tricking them into believing the pretend attacker is a real user.

    So-called "liveness tests" try to authenticate identities in real-time, relying on images or video streams from cameras like face recognition used to unlock mobile phones, for example. Nine out of ten vendors failed Sensity's live deepfake attacks.

    Continue reading
  • Lonestar plans to put datacenters in the Moon's lava tubes
    How? Founder tells The Register 'Robots… lots of robots'

    Imagine a future where racks of computer servers hum quietly in darkness below the surface of the Moon.

    Here is where some of the most important data is stored, to be left untouched for as long as can be. The idea sounds like something from science-fiction, but one startup that recently emerged from stealth is trying to turn it into a reality. Lonestar Data Holdings has a unique mission unlike any other cloud provider: to build datacenters on the Moon backing up the world's data.

    "It's inconceivable to me that we are keeping our most precious assets, our knowledge and our data, on Earth, where we're setting off bombs and burning things," Christopher Stott, founder and CEO of Lonestar, told The Register. "We need to put our assets in place off our planet, where we can keep it safe."

    Continue reading

Biting the hand that feeds IT © 1998–2022