Google announced another move forward in text to speech synthesis, Tacotron2. This adds emphasis and prosody and better pronunciation to TTS. AI is now going to be better at determining the proper way to say something written in English than a human.

We’re going to see more of these milestones over the next year. The next will be that a majority of people will be unable to determine the difference between TTS and human read speech (at least for short snippets). This is now going to make things very spooky and could open the door for bots to call places on our behalf.

