28 September 2022
Posted by Rakesh Iyer, Staff Software Engineer and Leland Rechis, Group Product Manager
We are upgrading the Speech Services by Google speech engine in a big way, providing clearer, more natural voices. All 421 voices in 67 languages have been upgraded with a new voice model and synthesizer.
If you already use TTS and the Speech Services by Google engine, there is nothing to do – everything will happen behind the scenes as your users will have automatically downloaded the latest update. We’ve seen a significant side by side quality increase with this change, particularly in respects to clarity and naturalness.
With this upgrade we will also be changing the default voice in en-US to one that is built using fresher speaker data, which alongside our new stack, results in a drastic improvement. If your users have not selected a system voice, and you rely on system defaults, they will hear a slightly different speaker. You can hear the difference below
Speaker change and upgrade for EN-US
Sample Current Speaker |
Sample Upgraded Speaker |
|
|
Speaker upgrades in a few other languages
Current |
Upgraded |
HI-IN |
HI-IN |
|
|
PT-BR |
PT-BR |
|
|
ES-US |
ES-US |
|
|
This update will be rolling out to all 64 bit Android devices via the Google Play Store over the next few weeks as a part of the Speech Services by Google apk. If you are concerned your users have not updated this yet, you can check for the minimum version code ,210390644 on the package com.google.android.tts.
If you haven't used TTS in your projects yet, or haven’t given your users the ability to choose a voice within your app, it's fairly straightforward, and easy to experiment with. We’ve included some sample code to get you started.
Here’s an example of how to set up voice synthesis, get a list of voices, and set a specific voice. We finally send a simple utterance to the synthesizer.
class MainActivity : AppCompatActivity() { |
We are excited to see this upgraded experience in your app!