Cloud TTS release notes

This page documents production updates to Cloud TTS. You can periodically check this page for announcements about new or updated features, bug fixes, known issues, and deprecated functionality.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly.

November 10, 2025

Feature

Chirp 3: HD voices now support speech synthesis in the following languages: bg-BG, cs-CZ, et-EE, el-GR, he-IL, hr-HR, hu-HU, lt-LT, lv-LV, ro-RO, sk-SK, sl-SI, and sr-RS. For more information, see Chirp 3: Language Availability.

November 07, 2025

Feature

Gemini TTS now supports synthesis for streaming requests. For more information, see the Gemini TTS page.

Change

Gemini TTS now supports relaxing safety filters for accounts with monthly invoiced billing. For more information, see the Gemini TTS page.

October 21, 2025

Change

Chirp 3: instant custom voice now supports voice cloning key generation in the eu and us regions. For more information, see the Chirp 3: instant custom voice page.

October 17, 2025

Feature

Chirp 3 HD now supports speech synthesis using SSML input. Supported SSML tags are: <phoneme>, <p>, <s>, <sub>, and <say-as>. For more information, see Chirp 3 HD: SSML support.

June 18, 2025

Change

Chirp 3: Instant Custom Voice now extends support to ja-JP, now supporting more than 30 locales. For more information, check the Chirp 3: Instant Custom Voice documentation.

May 07, 2025

Change

We just released three new voice features for Chirp 3: HD Voices. Pace control is available across all locales; pause control is available across all locales; custom pronunciations is available across all locales except bn-in, gu-in, nl-be, sw-ke, th-th, uk-ua, ur-in, and vi-vn. Be sure to check our Chirp 3: HD Voices documentation for more information.

April 16, 2025

Change

The polyglot voices feature is only supported in multi-regions.

April 02, 2025

Announcement

Chirp 3: HD voices with 8 speakers and 31 locales is now GA. It offers real-time streaming and batch processing capabilities and is accessible in global, us, eu, and asia-southeast1 regions.

Explore the latest Chirp 3: HD voices capabilities. Find out their full potential by visiting our updated documentation, specifically the voice controls section.

March 27, 2025

Change

Chirp 3: HD voices in en-US now support experimental features for pace and pause controls.

March 17, 2025

Change

Chirp 3: HD voices are only available in the global, us, eu, and asia-southeast1 regions. To use these voices, switch your endpoint to a supported region.

March 06, 2025

Change

Chirp 3: HD voices now supports 8 new speakers in 31 new locales: ar-XA, bn-IN, cmn-CN, de-DE, en-AU, en-GB, en-IN, en-US, es-ES, es-US, fr-CA, fr-FR, gu-IN, hi-IN, id-ID, it-IT, ja-JP, kn-IN, ko-KR, ml-IN, mr-IN, nl-NL, pl-PL, pt-BR, ru-RU, sw-KE, ta-IN, te-IN, th-TH, tr-TR, and vi-VN.

February 10, 2025

Change

Journey voices have been rebranded as Chirp HD voices.

December 03, 2024

Change

Journey Voices now supports the Journey-O speaker for de-de, en-au, en-in, en-gb, es-es, es-us, fr-ca, fr-fr, and it-it.

November 22, 2024

Change

Cloud TTS Journey voices have been updated to improve the accuracy of generated speech. This means you should notice fewer instances of dropped words.

November 11, 2024

Change

Journey Voices now supports the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.

October 30, 2024

Change

Studio Voices now support synthesis with multiple speakers to generate audios for interviews, interactive storytelling, video games, e-learning platforms, and accessibility solutions.

October 18, 2024

Change

Journey Voices and streaming synthesis now support the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.

May 14, 2024

Change

Cloud Text-to-Speech now offers updated Journey voices with an additional speaker, en-us-Journey-O.

April 19, 2024

Change

Cloud Text-to-Speech now offers es-ES Studio voices: es-ES-Studio-C and es-ES-Studio-F

December 29, 2023

Announcement

Journey voices are now in experimental.

November 29, 2023

Announcement

Cloud Text-to-Speech now offers de-DE and fr-FR Studio voices: de-DE-Studio-B, de-DE-Studio-C, fr-FR-Studio-A, and fr-FR-Studio-D.

November 06, 2023

Change

As of November 13 2023, speaker en-US-Studio-M will no longer be available. All requests sent to en-US-Studio-M will be routed to speaker en-US-Studio-Q. There is no action needed.

November 03, 2023

Announcement

Cloud Text-to-Speech now offers en-GB Studio voices: en-GB-Studio-B and en-GB-Studio-C.

October 25, 2023

Feature

Styles are now supported in Neural2 voices through SSML. The following styles are supported

  • <google:emotion name="apologetic">
  • <google:emotion name="calm">
  • <google:emotion name="empathetic">
  • <google:emotion name="firm">
  • <google:emotion name="lively">
for the following voices:
  • en-us-Neural2-F
  • en-us-Neural2-J

October 24, 2023

Announcement

Studio voices now support 5,000 bytes of either text or SSML input per synthesis request.

Feature
Feature

Long Audio Synthesis now supports SSML inputs.

October 16, 2023

Feature

The Long Audio Synthesis API now supports the following languages: English, Spanish, French, German, Japanese, Hindi, Italian, Korean, Portuguese, Thai, Vietnamese, Danish, Filipino.

Change

There is no longer billing differentiation for Cloud Text-to-Speech Offline Custom Voice API calls. See the <ReportedUsage> documentation for more details.

June 28, 2023

Change

Studio voices now support SSML, except for the following tags: <mark>, <emphasis>, <prosody>, and <lang>

September 01, 2022

Change

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

  1. ta-IN-Wavenet-C
  2. ta-IN-Standard-C
  3. ta-IN-Wavenet-D
  4. ta-IN-Standard-D

August 19, 2022

Change

Text-to-Speech has improved the quality of these voices

  1. pt-br-Standard-A
  2. pt-br-Standard-B

August 05, 2022

Feature

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

  1. pt-BR-Standard-C
  2. pt-BR-Wavenet-C

June 27, 2022

Announcement

Cloud Text-to-Speech now supports Neural2 voices in addition to Standard and WaveNet voice generation models. Neural2 uses Custom Voice technology without the need to train a unique voice. Neural2 voices are in Preview and are currently available in a single region for a limited number of languages.

June 17, 2021

Feature

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

  • ms-MY (Malay, Malaysia)
  • nl-BE (Dutch, Belgium)

April 09, 2021

Feature

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

  • es-US (Spanish, US)
  • af-ZA (Afrikaans, South Africa)
  • bg-BG (Bulgarian, Bulgaria)
  • ca-ES (Catalan, Spain)
  • is-IS (Icelandic, Iceland)
  • lv-LV (Latvian, Latvia)
  • sr-RS (Serbian, Cyrillic)

April 07, 2021

Feature

Text-to-Speech now supports MULAW and ALAW audio encodings. See the AudioEncoding reference documentation for details.

March 01, 2021

Feature

Text-to-Speech has launched Beta support of new SSML tags: <phoneme>, <mark>, <lang>, <voice>, and <say-as interpret-as="duration"> to specify durations. See the phonemes for a list of phonemes available for your language.

Fixed

Support for the <prosody> SSML tag has been enhanced to produce continuous TTS when possible.

  • Text-to-speech has resolved an issue that affected how volume changes are calculated, resulting in different but correct behavior.
  • Text-to-speech has resolved an issue that affected how pitch changes are calculated, resulting in different but correct behavior.
Fixed

Text-to-Speech has improved the continuity of mixed-media results. Now when you mix text and sounds within a <s>/<s> block, Text-to-Speech generates a much shorter pause and better transition between the synthesized speech and the sound.

Fixed

Text-to-Speech has improved the verbalization and pacing of phone numbers.

Announcement

Text-to-Speech has improved its handling of speech synthesis requests sent using SSML markup.

January 22, 2021

Feature

New language: Text-to-Speech now supports Romanian (ro-RO). See the supported voices page for details and audio samples.

Feature

New voice: Text-to-Speech now offers 2 new Bengali (bn-IN) WaveNet voices. See the supported voices page for details and audio samples.

August 24, 2020

Feature

Text-to-Speech now offers four new English (US) voices, available as both WaveNet and Standard models. See the supported voices and languages page for more details.

Feature

Text-to-Speech now offers four new Chinese (Hong Kong) voices, available as Standard models. See the supported voices and languages page for more details.

February 05, 2019

Feature

The audio profile feature is generally available for use in new applications. Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

Feature

Added new Standard and WaveNet voices in the following languages and variants:

  • Danish (Denmark)
  • Polish (Poland)
  • Portuguese (Brazil)
  • Russian (Russia)
  • Slovak (Slovakia)
  • Turkish (Turkey)
  • Ukrainian (Ukraine)

Review the voices list for complete details.

March 27, 2018

Feature

Cloud Text-to-Speech API is now available in beta.

March 20, 2018

Change

The names of voices provided for speech synthesis in the Text-to-Speech API have changed. Previous versions of the voice names do not work. To see the list of voices provided in the Text-to-Speech API including the correct names, see the voice list.

March 02, 2018

Change

The gender field changed to ssmlGender.

February 02, 2018

Change

Add voices in the following languages:

  • English (US)
  • French (Canada)
  • Dutch
  • Portuguese (Brazil)
  • Swedish
  • Turkish

See the voices list for complete details.