This page documents production updates to Cloud TTS. You can periodically check this page for announcements about new or updated features, bug fixes, known issues, and deprecated functionality.
You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or programmatically access release notes in BigQuery.
To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly.
November 10, 2025
Chirp 3: HD voices now support speech synthesis in the following languages: bg-BG, cs-CZ, et-EE, el-GR, he-IL, hr-HR, hu-HU, lt-LT, lv-LV, ro-RO, sk-SK, sl-SI, and sr-RS. For more information, see Chirp 3: Language Availability.
November 07, 2025
Gemini TTS now supports synthesis for streaming requests. For more information, see the Gemini TTS page.
October 17, 2025
Chirp 3 HD now supports speech synthesis using SSML input. Supported SSML tags are: <phoneme>, <p>, <s>, <sub>, and <say-as>. For more information, see Chirp 3 HD: SSML support.
September 30, 2025
Gemini-TTS is generally available (GA) and provides support for 30 voices and over 70 locales. You can synthesize single or multi-speaker speech from short snippets to long-form narratives. You can precisely dictate style, accent, pace, tone, and even emotional expression using natural-language prompts.
For more information, see Gemini TTS. Give it a try in Media Studio.
September 15, 2025
Chirp 3: HD voices is available on the asia-northeast1 endpoint. For more information, see Chirp 3: HD voices.
August 21, 2025
Chirp 3: Instant custom voice supports new input audio encodings PCM, MP3, and M4A, with any sample rate. For more information, see the Chirp 3: Instant custom voice page.
July 24, 2025
Chirp 3: HD voices now offers General Availability (GA) support for four additional Nordic languages: Danish (da-DK), Finnish (fi-FI), Norwegian Bokmål (nb-NO), and Swedish (sv-SE). For more information, see Chirp 3: HD voices.
April 16, 2025
The polyglot voices feature is only supported in multi-regions.
March 27, 2025
Chirp 3: HD voices in en-US now support experimental features for pace and pause controls.
March 17, 2025
Chirp 3: HD voices are only available in the global, us, eu, and asia-southeast1 regions. To use these voices, switch your endpoint to a supported region.
March 06, 2025
Chirp 3: HD voices now supports 8 new speakers in 31 new locales: ar-XA, bn-IN, cmn-CN, de-DE, en-AU, en-GB, en-IN, en-US, es-ES, es-US, fr-CA, fr-FR, gu-IN, hi-IN, id-ID, it-IT, ja-JP, kn-IN, ko-KR, ml-IN, mr-IN, nl-NL, pl-PL, pt-BR, ru-RU, sw-KE, ta-IN, te-IN, th-TH, tr-TR, and vi-VN.
February 10, 2025
Journey voices have been rebranded as Chirp HD voices.
November 22, 2024
Cloud TTS Journey voices have been updated to improve the accuracy of generated speech. This means you should notice fewer instances of dropped words.
November 11, 2024
Journey Voices now supports the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.
September 10, 2024
Journey Voices is now in Preview and supports text streaming.
February 26, 2024
Studio voices are now GA.
November 06, 2023
As of November 13 2023, speaker en-US-Studio-M will no longer be available. All requests sent to en-US-Studio-M will be routed to speaker en-US-Studio-Q. There is no action needed.
October 25, 2023
Styles are now supported in Neural2 voices through SSML. The following styles are supported
<google:emotion name="apologetic"><google:emotion name="calm"><google:emotion name="empathetic"><google:emotion name="firm"><google:emotion name="lively">
- en-us-Neural2-F
- en-us-Neural2-J
October 24, 2023
Long Audio Synthesis now supports Studio voices.
Long Audio Synthesis now supports SSML inputs.
October 16, 2023
There is no longer billing differentiation for Cloud Text-to-Speech Offline Custom Voice API calls. See the <ReportedUsage> documentation for more details.
January 10, 2023
On or after July 9th, 2023, Cloud Text-to-Speech will replace the following voices with new voices of similar quality and accent. The new voices are available to try now. No action will be needed from you to switch to the new voice on July 9th, 2023. However, you are free to switch to the new voice at any time.
- Removing ml-IN-Standard-A
- Redirecting to ml-IN-Standard-C
- Removing ml-IN-Wavenet-A
- Redirecting ml-IN-Wavenet-C
- Removing ml-IN-Standard-B
- Redirecting to ml-IN-Standard-D
- Removing ml-IN-Wavenet-B
- Redirecting ml-IN-Wavenet-D
- Removing bn-IN-Standard-A
- Redirecting to bn-IN-Standard-C
- Removing bn-IN-Wavenet-A
- Redirecting bn-IN-Wavenet-C
- Removing bn-IN-Standard-B
- Redirecting to bn-IN-Standard-D
- Removing bn-IN-Wavenet-B
- Redirecting bn-IN-Wavenet-D
- Removing kn-IN-Standard-A
- Redirecting to kn-IN-Standard-C
- Removing kn-IN-Wavenet-A
- Redirecting kn-IN-Wavenet-C
- Removing kn-IN-Standard-B
- Redirecting to kn-IN-Standard-D
- Removing kn-IN-Wavenet-B
- Redirecting kn-IN-Wavenet-D
- Removing gu-IN-Standard-A
- Redirecting to gu-IN-Standard-C
- Removing gu-IN-Wavenet-A
- Redirecting gu-IN-Wavenet-C
- Removing gu-IN-Standard-B
- Redirecting to gu-IN-Standard-D
- Removing gu-IN-Wavenet-B
- Redirecting gu-IN-Wavenet-D
- Removing it-IT-Standard-A
- Redirecting to it-IT-Standard-B
- Removing it-IT-Wavenet-A
- Redirecting to it-IT-Wavenet-B
- Removing es-ES-Standard-A
- Redirecting to es-ES-Standard-C
December 22, 2022
Text-to-Speech now offers these new news reading voices. See the supported voices page for a complete list of voices and audio samples.
- es-US-News-D
- es-US-News-E
- es-US-News-F
- es-US-News-G
- en-AU-News-E
- en-AU-News-F
- en-AU-News-G
- en-GB-News-G
- en-GB-News-H
- en-GB-News-I
- en-GB-News-J
- en-GB-News-K
- en-GB-News-L
- en-GB-News-M
November 10, 2022
Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.
- en-US-News-K
- en-US-News-L
- en-US-News-M
- en-US-News-N
October 07, 2022
On or after April 8th, 2023, Cloud Text-to-Speech will replace the following voices with new voices of similar quality and accent. The new voices are available to try now. No action will be needed from you to switch to the new voice on April 8th, 2023. However, you are free to switch to the new voice at anytime
- Removing ta-IN-Standard-A
- Redirecting to ta-IN-Standard-C
- Removing ta-IN-Wavenet-A
- Redirecting to ta-IN-Wavenet-C
- Removing ta-IN-Standard-B
- Redirecting to ta-IN-Standard-D
- Removing ta-IN-Wavenet-B
- Redirecting to ta-IN-Wavenet-D
- Removing pt-BR-Standard-A
- Redirecting to pt-BR-Standard-C
- Removing pt-BR-Wavenet-A
- Redirecting to pt-BR-Wavenet-C
- Removing ja-JP-Standard-A
- Redirecting to ja-JP-Standard-B
- Removing ja-JP-Wavenet-A
- Redirecting to ja-JP-Wavenet-B
September 01, 2022
Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.
- ta-IN-Wavenet-C
- ta-IN-Standard-C
- ta-IN-Wavenet-D
- ta-IN-Standard-D
August 05, 2022
Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.
- pt-BR-Standard-C
- pt-BR-Wavenet-C
June 27, 2022
Cloud Text-to-Speech now supports Neural2 voices in addition to Standard and WaveNet voice generation models. Neural2 uses Custom Voice technology without the need to train a unique voice. Neural2 voices are in Preview and are currently available in a single region for a limited number of languages.
April 09, 2021
Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.
- es-US (Spanish, US)
- af-ZA (Afrikaans, South Africa)
- bg-BG (Bulgarian, Bulgaria)
- ca-ES (Catalan, Spain)
- is-IS (Icelandic, Iceland)
- lv-LV (Latvian, Latvia)
- sr-RS (Serbian, Cyrillic)
January 22, 2021
New language: Text-to-Speech now supports Romanian (ro-RO). See the supported voices page for details and audio samples.
New voice: Text-to-Speech now offers 2 new Bengali (bn-IN) WaveNet voices. See the supported voices page for details and audio samples.
August 24, 2020
Text-to-Speech now offers four new English (US) voices, available as both WaveNet and Standard models. See the supported voices and languages page for more details.
May 01, 2020
Cloud Text-to-Speech now offers 36 new voices (both Standard and WaveNet) in the following languages. See the Supported Voices and Languages page for complete details.
- Arabic
- Bengali (India)
- English (India)
- French (France)
- German (Germany)
- Gujarati (India)
- Hindi (India)
- Indonesian (Indonesia)
- Kannada (India)
- Malayalam (India)
- Mandarin Chinese
- Russian (Russia)
- Tamil (India)
- Telugu (India)
- Thai (Thailand)
February 05, 2019
The audio profile feature is generally available for use in new applications. Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.
Added new Standard and WaveNet voices in the following languages and variants:
- Danish (Denmark)
- Polish (Poland)
- Portuguese (Brazil)
- Russian (Russia)
- Slovak (Slovakia)
- Turkish (Turkey)
- Ukrainian (Ukraine)
Review the voices list for complete details.
July 24, 2018
Added new WaveNet voices in the following languages and variants:
- Dutch (Netherlands)
- English (Australia)
- English (UK)
- German
- Italian
- Japanese
Review the voices list for complete details.
March 20, 2018
The names of voices provided for speech synthesis in the Text-to-Speech API have changed. Previous versions of the voice names do not work. To see the list of voices provided in the Text-to-Speech API including the correct names, see the voice list.