Can YOU tell the difference between a human voice and a robot?

As technology progresses, computer-generated voices are getting more realistic every single day. I’ll explore how the technology evolved, its current uses, and how it could be misused in the future.

You might not realise it, but computer-generated voices are everywhere, from voice assistants like Siri to text-to-speech functions on apps like TikTok. With AI progressing really fast, computer-generated voices are now realistic, but are they becoming too realistic?

Computer-generated voices are now mostly used for accessibility. Text-to-speech on iOS’ VoiceOver feature lets the blind browse the Internet and use social media like Facebook and Twitter. It’s also used by the telecommunications industry. Every time you call a helpdesk and there’s a robot speaking, that’s the same technology going on. The most impressive use so far is definitely Google Duplex, which can call and book appointments for you with a computer voice that sounds eerily natural.

To understand how we got here, let’s go back a couple of decades to see how the technology developed.

Christian Gottlieb Kratzenstein. Source

The first case of synthesized speech can be tracked all the way back to 1779 when scientist Christian Gottlieb Kratzenstein built a model of a human vocal tract that could produce the five long vowel sounds.

Fast-forward to the electronic age, we get speech synthesiser chips like in the Speak & Spell toys that now can actually say words, even though it sounds quite robotic. And then we reach the digital age, where we get computer-generated pop stars like Hatsune Miku and insane celebrity deepfakes.

You might ask “How did these voices get so good?”. The answer lies in emotion. As humans, we don’t really speak perfectly. There are a lot of imperfections. We need to take breaths, or say uhm and ah. So the key to making computer speech sound realistic is by adding those imperfections in.

Sonantic is a company that specialises in making AI voices that are expressive. For example, Siri and TikTok only use words for the input, so the results are pretty flat. Sonantic, on the other hand, allows you to specify the emotion, pacing, and even the pitch contour of the lines. When you add in breaths and background music, it’s almost impossible to tell the difference between a computer and a human.

At this point, I think it’s clear that this technology is not something to be taken lightly. Even though they can be used for good, AI voices can be quite dangerous when put in the wrong hands.

Two years ago, scammers used AI to mimic a CEO’s voice and actually got about a million ringgit out of the company. And recently, people are mimicking celebrities without their permission too. In a documentary about the late chef Anthony Bourdain, the filmmaker used AI to make Anthony say words that he’s never said out loud before, which sparked a lot of discussions about the ethics of it all.

We can’t stop the technology from progressing, but what we can do is learn to adapt and develop policies to prevent the abuse of AI voices.

So, what do you think? Are you optimistic for the future of AI voices, or are you scared of what it might bring? Let us know in the comments section!

Recent Posts

JomCharge x DBKL turn on EV chargers at McDonald’s Sri Petaling

JomCharge x DBKL street-level EV charger deployment continues and the latest location is in Sri…

9 hours ago

Can you and your family enjoy a 100% electric drive without ever plugging in?

This post is brought to you by Nissan. For many Malaysian families, the idea of…

23 hours ago

Gentari’s largest EV Charging Hub in Penang, 540kW total capacity with 6 bays at Bayan Baru

Besides deploying more DC Chargers in Penang Island in partnership with MBPP, Gentari has just…

1 day ago

BMW 7 Series gets Neue Klasse upgrade. New i7 now offers over 700km range and 250kW DC fast charging

BMW has officially revealed the updated 7th generation BMW 7 Series (G70), and this isn’t…

2 days ago

Oppo Find X9s goes official in Malaysia: Triple 50MP Hasselblad cameras, Dimensity 9500s, 6.59″ AMOLED, priced at RM3,899

Aside from the big boss Find X9 Ultra, Oppo Malaysia has also introduced another member…

2 days ago

Honor 600 series launched in Malaysia: Snapdragon 8 Elite, 200MP camera, 7,000mAh battery, priced from RM2,599

The Honor 600 and Honor 600 Pro have finally made their launch in Malaysia, making…

2 days ago

This website uses cookies.