Image for post
Image for post

Google announced another move forward in text to speech synthesis, Tacotron2. This adds emphasis and prosody and better pronunciation to TTS. AI is now going to be better at determining the proper way to say something written in English than a human.

We’re going to see more of these milestones over the next year. The next will be that a majority of people will be unable to determine the difference between TTS and human read speech (at least for short snippets). This is now going to make things very spooky and could open the door for bots to call places on our behalf.

Independent daily thoughts on all things future, voice technologies and AI. More at

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store