Microsoft: Our AI speech recognition mangles your words the least

Cortana may be due an upgrade


Microsoft researchers working on AI computer speech recognition have reached a word error rate of 6.3 per cent, claiming to be the lowest in the industry.

Hot on the heels of Google DeepMind announcing a “breakthrough” in AI speech recognition, Microsoft was quick to respond by saying it, too, has reached a “milestone” while using neural networks.

A paper released on arXiv shows the researchers have combined “neural-network-based acoustic and language modelling” on the US National Institute of Standards and Technology (NIST) 2000 Switchboard task - a conversational telephone speech recognition test used as an industry standard.

Artificial neural networks which are modelled loosely on how scientists believe the brain performs calculations are often used in speech processing.

Microsoft has used a mixture of convolutional neural networks that don’t form a closed cycle of nodes and a recurrent neural network that does. Both are useful for analysing the large data sets used to train the networks to process language, which requires a lot of computing power.

Human language is difficult for computers to understand. To make it easier, models are used to predict the probability distribution over sequences of words using audio signals from speech.

Microsoft uses a 30k-word vocabulary database derived from the most common words in the Switchboard and Fisher corpora to recognise words from speech.

Computer speech recognition has come a long way. Twenty years ago the error rate of the best published research system had a word error rate that was greater than 43 percent, Microsoft said.

Speech recognition is a hot trend in AI as companies race to build the best AI personal assistant. Microsoft has Cortana, Apple uses Siri, Google has its WaveNet system and Amazon has just announced a UK release for its Echo device. ®

Broader topics


Other stories you might like

  • Robotics and 5G to spur growth of SoC industry – report
    Big OEMs hogging production and COVID causing supply issues

    The system-on-chip (SoC) side of the semiconductor industry is poised for growth between now and 2026, when it's predicted to be worth $6.85 billion, according to an analyst's report. 

    Chances are good that there's an SoC-powered device within arm's reach of you: the tiny integrated circuits contain everything needed for a basic computer, leading to their proliferation in mobile, IoT and smart devices. 

    The report predicting the growth comes from advisory biz Technavio, which looked at a long list of companies in the SoC market. Vendors it analyzed include Apple, Broadcom, Intel, Nvidia, TSMC, Toshiba, and more. The company predicts that much of the growth between now and 2026 will stem primarily from robotics and 5G. 

    Continue reading
  • Deepfake attacks can easily trick live facial recognition systems online
    Plus: Next PyTorch release will support Apple GPUs so devs can train neural networks on their own laptops

    In brief Miscreants can easily steal someone else's identity by tricking live facial recognition software using deepfakes, according to a new report.

    Sensity AI, a startup focused on tackling identity fraud, carried out a series of pretend attacks. Engineers scanned the image of someone from an ID card, and mapped their likeness onto another person's face. Sensity then tested whether they could breach live facial recognition systems by tricking them into believing the pretend attacker is a real user.

    So-called "liveness tests" try to authenticate identities in real-time, relying on images or video streams from cameras like face recognition used to unlock mobile phones, for example. Nine out of ten vendors failed Sensity's live deepfake attacks.

    Continue reading
  • Lonestar plans to put datacenters in the Moon's lava tubes
    How? Founder tells The Register 'Robots… lots of robots'

    Imagine a future where racks of computer servers hum quietly in darkness below the surface of the Moon.

    Here is where some of the most important data is stored, to be left untouched for as long as can be. The idea sounds like something from science-fiction, but one startup that recently emerged from stealth is trying to turn it into a reality. Lonestar Data Holdings has a unique mission unlike any other cloud provider: to build datacenters on the Moon backing up the world's data.

    "It's inconceivable to me that we are keeping our most precious assets, our knowledge and our data, on Earth, where we're setting off bombs and burning things," Christopher Stott, founder and CEO of Lonestar, told The Register. "We need to put our assets in place off our planet, where we can keep it safe."

    Continue reading

Biting the hand that feeds IT © 1998–2022