Cloud TTS release notes

This page documents production updates to Cloud TTS. You can periodically check this page for announcements about new or updated features, bug fixes, known issues, and deprecated functionality.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly.

December 30, 2025

Feature

Chirp 3: HD voices now support speech synthesis in Preview for the Chinese (Hong Kong) yue-HK language. For more information, see Chirp 3: Language Availability.

December 12, 2025

Feature

Chirp 3: HD voices now support speech synthesis in Preview for the Punjabi (pa-IN) language. For more information, see Chirp 3: Language Availability.

December 11, 2025

Feature

Our latest Gemini TTS models are now available in more regions.

gemini-2.5-pro-tts: global, us, eu
gemini-2.5-flash-tts: global, us, eu, and northamerica-northeast1
gemini-2.5-flash-lite-preview-tts: global, us, eu, and northamerica-northeast1

For more information on how to take advantage of the extended regional availability, see the Gemini TTS page.

November 10, 2025

Feature

Chirp 3: HD voices now support speech synthesis in the following languages: bg-BG, cs-CZ, et-EE, el-GR, he-IL, hr-HR, hu-HU, lt-LT, lv-LV, ro-RO, sk-SK, sl-SI, and sr-RS. For more information, see Chirp 3: Language Availability.

November 07, 2025

Change

Gemini TTS now supports relaxing safety filters for accounts with monthly invoiced billing. For more information, see the Gemini TTS page.

Feature

Gemini TTS now supports synthesis for streaming requests. For more information, see the Gemini TTS page.

October 21, 2025

Change

Chirp 3: instant custom voice now supports voice cloning key generation in the eu and us regions. For more information, see the Chirp 3: instant custom voice page.

October 17, 2025

Feature

Chirp 3 HD now supports speech synthesis using SSML input. Supported SSML tags are: <phoneme>, <p>, <s>, <sub>, and <say-as>. For more information, see Chirp 3 HD: SSML support.

September 30, 2025

Feature

We just released Gemini-2.5 TTS Flash (gemini-2.5-flash-tts) and Pro (gemini-2.5-pro-tts) in GA with support for 30 speakers in 80 or more locales. The latest models enable granular control over style, accent, pace, and emotion using natural language prompts.

This release supports single and multi-speaker synthesis for applications ranging from short commands to long-form narratives. For more information, see the Gemini TTS documentation page.

Announcement

Gemini-TTS is generally available (GA) and provides support for 30 voices and over 70 locales. You can synthesize single or multi-speaker speech from short snippets to long-form narratives. You can precisely dictate style, accent, pace, tone, and even emotional expression using natural-language prompts.

For more information, see Gemini TTS. Give it a try in Media Studio.

September 15, 2025

Change

Chirp 3: HD voices is available on the asia-northeast1 endpoint. For more information, see Chirp 3: HD voices.

August 27, 2025

Change

Chirp 3: instant custom voice supports the Chirp 3: HD voice controls for pace control, pause control, and custom pronunciations. For more information, see the Chirp 3: instant custom voice page.

Change

Chirp 3: HD voices is available on the europe-west2 endpoint. For more information, see Chirp 3: HD voices.

August 21, 2025

Change

Chirp 3: Instant custom voice supports new input audio encodings PCM, MP3, and M4A, with any sample rate. For more information, see the Chirp 3: Instant custom voice page.

July 24, 2025

Change

Chirp 3: HD voices now offers General Availability (GA) support for four additional Nordic languages: Danish (da-DK), Finnish (fi-FI), Norwegian Bokmål (nb-NO), and Swedish (sv-SE). For more information, see Chirp 3: HD voices.

June 18, 2025

Change

Chirp 3: Instant Custom Voice now extends support to ja-JP, now supporting more than 30 locales. For more information, check the Chirp 3: Instant Custom Voice documentation.

May 07, 2025

Change

We just released three new voice features for Chirp 3: HD Voices. Pace control is available across all locales; pause control is available across all locales; custom pronunciations is available across all locales except bn-in, gu-in, nl-be, sw-ke, th-th, uk-ua, ur-in, and vi-vn. Be sure to check our Chirp 3: HD Voices documentation for more information.

April 16, 2025

Change

The polyglot voices feature is only supported in multi-regions.

April 02, 2025

Announcement

Chirp 3: HD voices with 8 speakers and 31 locales is now GA. It offers real-time streaming and batch processing capabilities and is accessible in global, us, eu, and asia-southeast1 regions.

Explore the latest Chirp 3: HD voices capabilities. Find out their full potential by visiting our updated documentation, specifically the voice controls section.

March 27, 2025

Change

Chirp 3: HD voices in en-US now support experimental features for pace and pause controls.

March 17, 2025

Change

Chirp 3: HD voices are only available in the global, us, eu, and asia-southeast1 regions. To use these voices, switch your endpoint to a supported region.

March 06, 2025

Change

Chirp 3: HD voices now supports 8 new speakers in 31 new locales: ar-XA, bn-IN, cmn-CN, de-DE, en-AU, en-GB, en-IN, en-US, es-ES, es-US, fr-CA, fr-FR, gu-IN, hi-IN, id-ID, it-IT, ja-JP, kn-IN, ko-KR, ml-IN, mr-IN, nl-NL, pl-PL, pt-BR, ru-RU, sw-KE, ta-IN, te-IN, th-TH, tr-TR, and vi-VN.

February 10, 2025

Change

Journey voices have been rebranded as Chirp HD voices.

December 03, 2024

Change

Journey Voices now supports the Journey-O speaker for de-de, en-au, en-in, en-gb, es-es, es-us, fr-ca, fr-fr, and it-it.

November 22, 2024

Change

Cloud TTS Journey voices have been updated to improve the accuracy of generated speech. This means you should notice fewer instances of dropped words.

November 11, 2024

Change

Journey Voices now supports the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.

October 30, 2024

Change

Studio Voices now support synthesis with multiple speakers to generate audios for interviews, interactive storytelling, video games, e-learning platforms, and accessibility solutions.

October 18, 2024

Change

Journey Voices and streaming synthesis now support the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.

September 10, 2024

Announcement

Journey Voices is now in Preview and supports text streaming.

May 14, 2024

Change

Cloud Text-to-Speech now offers updated Journey voices with an additional speaker, en-us-Journey-O.

April 19, 2024

Change

Cloud Text-to-Speech now offers es-ES Studio voices: es-ES-Studio-C and es-ES-Studio-F

February 26, 2024

Announcement

Casual voices are now in preview.

Announcement

Studio voices are now GA.

December 29, 2023

Announcement

Journey voices are now in experimental.

November 29, 2023

Announcement

Cloud Text-to-Speech now offers de-DE and fr-FR Studio voices: de-DE-Studio-B, de-DE-Studio-C, fr-FR-Studio-A, and fr-FR-Studio-D.

November 06, 2023

Change

As of November 13 2023, speaker en-US-Studio-M will no longer be available. All requests sent to en-US-Studio-M will be routed to speaker en-US-Studio-Q. There is no action needed.

November 03, 2023

Announcement

Cloud Text-to-Speech now offers en-GB Studio voices: en-GB-Studio-B and en-GB-Studio-C.

October 25, 2023

Feature

Styles are now supported in Neural2 voices through SSML. The following styles are supported

<google:emotion name="apologetic">
<google:emotion name="calm">
<google:emotion name="empathetic">
<google:emotion name="firm">
<google:emotion name="lively">

for the following voices:

en-us-Neural2-F
en-us-Neural2-J

October 24, 2023

Announcement

Studio voices now support 5,000 bytes of either text or SSML input per synthesis request.

Feature

Long Audio Synthesis now supports SSML inputs.

Feature

Long Audio Synthesis now supports Studio voices.

October 16, 2023

Feature

The Long Audio Synthesis API now supports the following languages: English, Spanish, French, German, Japanese, Hindi, Italian, Korean, Portuguese, Thai, Vietnamese, Danish, Filipino.

Change

There is no longer billing differentiation for Cloud Text-to-Speech Offline Custom Voice API calls. See the <ReportedUsage> documentation for more details.

June 28, 2023

Change

Studio voices now support SSML, except for the following tags: <mark>, <emphasis>, <prosody>, and <lang>

March 16, 2023

Feature

Cloud Text-to-Speech now offers Long Audio Synthesis. This new API can be used to synthesize texts longer than 5 KB. For more information about API usage using the command line, see Create long audio from text by using the command line.

March 06, 2023

Announcement

Text-to-Speech now offers a Spanish Studio voice, es-US-Studio-B, in addition to its existing English Studio voices.

February 16, 2023

Feature

Text-to-Speech offers these new voices. See the supported voices page for a complete list of voices and audio samples.

eu-ES-Standard-A
gl-ES-Standard-A

February 08, 2023

Change

Text-to-Speech now offers Studio voices. This voice type is designed specifically for use with long-form texts such as narration and news reading. See the supported voices page for a complete list of voices and audio samples.

en-US-Studio-M
en-US-Studio-O

January 10, 2023

Announcement

On or after July 9th, 2023, Cloud Text-to-Speech will replace the following voices with new voices of similar quality and accent. The new voices are available to try now. No action will be needed from you to switch to the new voice on July 9th, 2023. However, you are free to switch to the new voice at any time.

Removing ml-IN-Standard-A
- Redirecting to ml-IN-Standard-C
Removing ml-IN-Wavenet-A
- Redirecting ml-IN-Wavenet-C
Removing ml-IN-Standard-B
- Redirecting to ml-IN-Standard-D
Removing ml-IN-Wavenet-B
- Redirecting ml-IN-Wavenet-D
Removing bn-IN-Standard-A
- Redirecting to bn-IN-Standard-C
Removing bn-IN-Wavenet-A
- Redirecting bn-IN-Wavenet-C
Removing bn-IN-Standard-B
- Redirecting to bn-IN-Standard-D
Removing bn-IN-Wavenet-B
- Redirecting bn-IN-Wavenet-D
Removing kn-IN-Standard-A
- Redirecting to kn-IN-Standard-C
Removing kn-IN-Wavenet-A
- Redirecting kn-IN-Wavenet-C
Removing kn-IN-Standard-B
- Redirecting to kn-IN-Standard-D
Removing kn-IN-Wavenet-B
- Redirecting kn-IN-Wavenet-D
Removing gu-IN-Standard-A
- Redirecting to gu-IN-Standard-C
Removing gu-IN-Wavenet-A
- Redirecting gu-IN-Wavenet-C
Removing gu-IN-Standard-B
- Redirecting to gu-IN-Standard-D
Removing gu-IN-Wavenet-B
- Redirecting gu-IN-Wavenet-D
Removing it-IT-Standard-A
- Redirecting to it-IT-Standard-B
Removing it-IT-Wavenet-A
- Redirecting to it-IT-Wavenet-B
Removing es-ES-Standard-A
- Redirecting to es-ES-Standard-C

December 22, 2022

Feature

Text-to-Speech now offers these new news reading voices. See the supported voices page for a complete list of voices and audio samples.

es-US-News-D
es-US-News-E
es-US-News-F
es-US-News-G
en-AU-News-E
en-AU-News-F
en-AU-News-G
en-GB-News-G
en-GB-News-H
en-GB-News-I
en-GB-News-J
en-GB-News-K
en-GB-News-L
en-GB-News-M

Feature

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

ml-IN-Wavenet-C
ml-IN-Wavenet-D

Note these voices are bilingual with en-IN.

November 29, 2022

Announcement

Text-to-Speech now offers additional Neural2 voices across 9 locales with 40+ speakers. Voices are available in the us-central1, us, and eu endpoints. See the supported voices page for a complete list of voices and audio samples.

November 10, 2022

Change

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

en-US-News-K
en-US-News-L
en-US-News-M
en-US-News-N

October 24, 2022

Change

Text-to-Speech improved the quality of these voices. See the supported voices page for a complete list of voices and audio samples.

en-GB-Wavenet-A
en-GB-Wavenet-B
en-GB-Wavenet-C
en-GB-Wavenet-D
en-GB-Wavenet-F
es-ES-Wavenet-B
es-ES-Wavenet-C
es-ES-Wavenet-D
hi-IN-Wavenet-A
hi-IN-Wavenet-B
hi-IN-Wavenet-C
hi-IN-Wavenet-D

October 07, 2022

Change

On or after April 8th, 2023, Cloud Text-to-Speech will replace the following voices with new voices of similar quality and accent. The new voices are available to try now. No action will be needed from you to switch to the new voice on April 8th, 2023. However, you are free to switch to the new voice at anytime

Removing ta-IN-Standard-A
1. Redirecting to ta-IN-Standard-C
Removing ta-IN-Wavenet-A
1. Redirecting to ta-IN-Wavenet-C
Removing ta-IN-Standard-B
1. Redirecting to ta-IN-Standard-D
Removing ta-IN-Wavenet-B
1. Redirecting to ta-IN-Wavenet-D
Removing pt-BR-Standard-A
1. Redirecting to pt-BR-Standard-C
Removing pt-BR-Wavenet-A
1. Redirecting to pt-BR-Wavenet-C
Removing ja-JP-Standard-A
1. Redirecting to ja-JP-Standard-B
Removing ja-JP-Wavenet-A
1. Redirecting to ja-JP-Wavenet-B

Change

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

mr-IN-Wavenet-A
mr-IN-Standard-A
mr-IN-Wavenet-B
mr-IN-Standard-B
mr-IN-Wavenet-C
mr-IN-Standard-C

September 01, 2022

Change

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

ta-IN-Wavenet-C
ta-IN-Standard-C
ta-IN-Wavenet-D
ta-IN-Standard-D

August 19, 2022

Change

Text-to-Speech has improved the quality of these voices

pt-br-Standard-A
pt-br-Standard-B

August 05, 2022

Feature

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

pt-BR-Standard-C
pt-BR-Wavenet-C

June 27, 2022

Announcement

Cloud Text-to-Speech now supports Neural2 voices in addition to Standard and WaveNet voice generation models. Neural2 uses Custom Voice technology without the need to train a unique voice. Neural2 voices are in Preview and are currently available in a single region for a limited number of languages.

March 09, 2022

Feature

Text-to-Speech now offers regional endpoints for the following places. See the How-to guides for more information on how to use these endpoints. - Europe - https://eu-texttospeech.googleapis.com - US - https://us-texttospeech.googleapis.com

June 17, 2021

Feature

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

ms-MY (Malay, Malaysia)
nl-BE (Dutch, Belgium)

April 09, 2021

Feature

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

es-US (Spanish, US)
af-ZA (Afrikaans, South Africa)
bg-BG (Bulgarian, Bulgaria)
ca-ES (Catalan, Spain)
is-IS (Icelandic, Iceland)
lv-LV (Latvian, Latvia)
sr-RS (Serbian, Cyrillic)

April 07, 2021

Feature

Text-to-Speech now supports MULAW and ALAW audio encodings. See the AudioEncoding reference documentation for details.

March 01, 2021

Feature

Text-to-Speech has launched Beta support of new SSML tags: <phoneme>, <mark>, <lang>, <voice>, and <say-as interpret-as="duration"> to specify durations. See the phonemes for a list of phonemes available for your language.

Fixed

Text-to-Speech has improved the continuity of mixed-media results. Now when you mix text and sounds within a <s>/<s> block, Text-to-Speech generates a much shorter pause and better transition between the synthesized speech and the sound.

Fixed

Support for the <prosody> SSML tag has been enhanced to produce continuous TTS when possible.

Text-to-speech has resolved an issue that affected how volume changes are calculated, resulting in different but correct behavior.
Text-to-speech has resolved an issue that affected how pitch changes are calculated, resulting in different but correct behavior.

Announcement

Text-to-Speech has improved its handling of speech synthesis requests sent using SSML markup.

Fixed

Text-to-Speech has improved the verbalization and pacing of phone numbers.

January 22, 2021

Feature

New language: Text-to-Speech now supports Romanian (ro-RO). See the supported voices page for details and audio samples.

Feature

New voice: Text-to-Speech now offers 2 new Bengali (bn-IN) WaveNet voices. See the supported voices page for details and audio samples.

August 24, 2020

Feature

Text-to-Speech now offers four new Chinese (Hong Kong) voices, available as Standard models. See the supported voices and languages page for more details.

Feature

Text-to-Speech now offers four new English (US) voices, available as both WaveNet and Standard models. See the supported voices and languages page for more details.

May 01, 2020

Feature

Cloud Text-to-Speech now offers 36 new voices (both Standard and WaveNet) in the following languages. See the Supported Voices and Languages page for complete details.

Arabic
Bengali (India)
English (India)
French (France)
German (Germany)
Gujarati (India)
Hindi (India)
Indonesian (Indonesia)
Kannada (India)
Malayalam (India)
Mandarin Chinese
Russian (Russia)
Tamil (India)
Telugu (India)
Thai (Thailand)

August 27, 2019

Feature

Cloud Text-to-Speech now offers 76 new voices, both standard and WaveNet, in the following languages:

Arabic
Czech
Dutch
English (India)
Filipino
Finnish
Greek
Hindi
Hungarian
Indonesian
Italian
Japanese
Mandarin Chinese
Norwegian
Vietnamese

February 05, 2019

Feature

The audio profile feature is generally available for use in new applications. Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

Feature

Added new Standard and WaveNet voices in the following languages and variants:

Danish (Denmark)
Polish (Poland)
Portuguese (Brazil)
Russian (Russia)
Slovak (Slovakia)
Turkish (Turkey)
Ukrainian (Ukraine)

Review the voices list for complete details.

August 28, 2018

Feature

Cloud Text-to-Speech API general availability (GA) release.

This release includes the public availability of the v1 API endpoint, both in REST and RPC.

July 24, 2018

Feature

Added new WaveNet voices in the following languages and variants:

Dutch (Netherlands)
English (Australia)
English (UK)
German
Italian
Japanese

Review the voices list for complete details.

Feature

Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

June 01, 2018

Feature

Added Korean (ko-KR) voice. Review the voices list for complete details.

March 27, 2018

Feature

Cloud Text-to-Speech API is now available in beta.

March 20, 2018

Change

The names of voices provided for speech synthesis in the Text-to-Speech API have changed. Previous versions of the voice names do not work. To see the list of voices provided in the Text-to-Speech API including the correct names, see the voice list.

March 02, 2018

Change

The gender field changed to ssmlGender.

February 02, 2018

Change

Add voices in the following languages:

English (US)
French (Canada)
Dutch
Portuguese (Brazil)
Swedish
Turkish

See the voices list for complete details.

November 10, 2017

Feature

Cloud Text-to-Speech API Alpha release.

Cloud TTS release notes Stay organized with collections Save and categorize content based on your preferences.

December 30, 2025

December 12, 2025

December 11, 2025

November 10, 2025

November 07, 2025

October 21, 2025

October 17, 2025

September 30, 2025

September 15, 2025

August 27, 2025

August 21, 2025

July 24, 2025

June 18, 2025

May 07, 2025

April 16, 2025

April 02, 2025

March 27, 2025

March 17, 2025

March 06, 2025

February 10, 2025

December 03, 2024

November 22, 2024

November 11, 2024

October 30, 2024

October 18, 2024

September 10, 2024

May 14, 2024

April 19, 2024

February 26, 2024

December 29, 2023

November 29, 2023

November 06, 2023

November 03, 2023

October 25, 2023

October 24, 2023

October 16, 2023

June 28, 2023

March 16, 2023

March 06, 2023

February 16, 2023

February 08, 2023

January 10, 2023

December 22, 2022

November 29, 2022

November 10, 2022

October 24, 2022

October 07, 2022

September 01, 2022

August 19, 2022

August 05, 2022

June 27, 2022

March 09, 2022

June 17, 2021

April 09, 2021

April 07, 2021

March 01, 2021

January 22, 2021

August 24, 2020

May 01, 2020

August 27, 2019

February 05, 2019

August 28, 2018

July 24, 2018

June 01, 2018

March 27, 2018

March 20, 2018

March 02, 2018

February 02, 2018

November 10, 2017

Cloud TTS release notes