Personal Tech

This article is more than 1 year old

Amazon can't channel the dead, but its deepfake voices take a close second

Megacorp shows Alexa speaking like kid's deceased grandma

Thu 23 Jun 2022 // 13:46 UTC

In the latest episode of Black Mirror, a vast megacorp sells AI software that learns to mimic the voice of a deceased woman whose husband sits weeping over a smart speaker, listening to her dulcet tones.

Only joking – it's Amazon, and this is real life. The experimental feature of the company's virtual assistant, Alexa, was announced at an Amazon conference in Las Vegas on Wednesday.

Rohit Prasad, head scientist for Alexa AI, described the tech as a means to build trust between human and machine, enabling Alexa to "make the memories last" when "so many of us have lost someone we love" during the pandemic.

In an explanatory video, Amazon showed a child asking: "Alexa, can Grandma finish reading me The Wizard of Oz?" at which point the assistant's normally artificial voice shifted gears into a softer, more natural timbre. The point being that it's supposed to convincingly sound like the kid's grandma.

Youtube Video

Though there was scant detail as to when or even if the technology would become publicly available, Prasad said Amazon was able to train the system to mimic a voice based on about a minute of recorded dialog, meaning users could potentially do this themselves at home.

It's an interesting use case on the face of it, and certainly a strange way to present it. That's because Amazon has not found a way to "channel the dead" – it has merely developed an inhouse deepfake for voices.

Deepfakes are manipulated video images, strung together by an AI from a variety of footage, to show their subject – a celebrity, politician, anyone – doing or saying something they have never in reality done.

Most examples of a politician being deepfaked to say something inflammatory would have been aligned with an audio recording of someone who can do a decent impression of the subject. With AI mimicry, this could potentially make deepfake footage all the more convincing.

The field around AI-manipulated voice is growing. Microsoft has its own take on voice mimicry, ostensibly to help restore impaired people's speech, but was concerned about the potential for abuse. The software could reproduce a voice based on a short sample, like Amazon, yet it has remained in limbo for years while company leaders wrestled with its ethical implications.

Meanwhile, a startup called Sanas recently dropped out of stealth with $32 million in funding. The idea behind its technology is to transform the speaker's accent into something closer to home for the recipient – for example, a sales droid from a call center in France speaking with an Alabama accent for their customer in that state. ®

Topics

Special Features

Vendor Voice

Resources

Personal Tech

Amazon can't channel the dead, but its deepfake voices take a close second

Megacorp shows Alexa speaking like kid's deceased grandma

More about

More about

Narrower topics

Broader topics

More about

More about

More about

Narrower topics

Broader topics

TIP US OFF

Other stories you might like

Amazon to lure upstarts with $500K in AWS AI credits each

GenAI will be bigger than the cloud or the internet, Amazon CEO hopes

Google Cloud chief is really psyched about this AI thing

Industrial systems integrating digitalisation

What's up with AI lately? Let's start with soaring costs, public anger, regulations...

Amazon finishes pumping $4B into AI darling Anthropic

AI spam is winning the battle against search engine quality

Amazon search results now less self-centered, boffin says

Snowmobile, Amazon's truck-powered migration service, reaches the end of the road

Arm flexes silicon muscles to push generative AI at the edge

Irish power crunch could be prompting AWS to ration compute resources

Developers are calling the shots on AI planning, judging by your experience

About Us

Our Websites

Your Privacy