Data residency

Data stored at rest in the customer selected location remains at rest in that location, independent of the Generative AI on Vertex AI endpoint called by that customer's request.

ML processing

Machine learning (ML) processing for Generative AI on Vertex AI services occurs within the specific region or multi-region where the request is made.

For any regional endpoint not explicitly listed in the following tables, such as those in the Middle East, there is no guarantee that ML processing occurs at a specific location. These endpoints support older models that don't offer ML processing guarantees.

ML processing for Google Cloud models

United States

Model US multi-region
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.5 Flash(gemini-2.5-flash)
Gemini 2.0 Flash(gemini-2.0-flash)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite)
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Tuning for Gemini 2.5 Pro(gemini-2.5-pro)
Tuning for Gemini 2.5 Flash(gemini-2.5-flash)
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001)
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)
Embeddings for Multimodal
Imagen 2(imagegeneration@005)
Chirp 3: Transcription(chirp_3)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice

Canada

Model Montréal(northamerica-northeast1)
Gemini 2.5 Flash Image (gemini-2.5-flash-image)
Gemini 2.5 Flash-Lite (gemini-2.5-flash-lite)
Gemini 2.5 Pro (gemini-2.5-pro)
Gemini 2.5 Flash , 1M (gemini-2.5-flash)
Gemini 2.0 Flash (gemini-2.0-flash)
Gemini 2.0 Flash-Lite (gemini-2.0-flash-lite)
Gemini Embeddings(gemini-embedding-001)
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)
Embeddings for Multimodal
Imagen 2(imagegeneration@005)
Chirp 3: Transcription(chirp_3)
Chirp 2: Transcription(chirp_2)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice

Europe

Model EU multi-region Paris(europe-west9) London(europe-west2) Frankfurt(europe-west3) Netherlands(europe-west4)
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash, 1M(gemini-2.5-flash)
Gemini 2.5 Flash, 128k(gemini-2.5-flash)
Tuning for Gemini 2.5 Flash(gemini-2.5-flash)
Gemini 2.5 Pro(gemini-2.5-pro)
Tuning for Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.0 Flash(gemini-2.0-flash-001)
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)
Embeddings for Multimodal
Imagen 2(imagegeneration@005)
Chirp 3: Transcription(chirp_3)
Chirp 2: Transcription(chirp_2)
Gemini 2.5 Flash TTS(gemini-2.5-flash-tts)
Gemini 2.5 Flash TTS(gemini-2.5-pro-tts)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice

Asia Pacific

Model Tokyo(asia-northeast1) Sydney(australia-southeast1) Mumbai(asia-south1) Singapore(asia-southeast1) Seoul(asia-northeast3)
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash, 1M(gemini-2.5-flash)
Gemini 2.5 Flash, 128k(gemini-2.5-flash)
Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.5 Pro, 64k(gemini-2.5-pro)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.0 Flash(gemini-2.0-flash-001)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)
Embeddings for Multimodal
Imagen 2(imagegeneration@005)
Chirp 3: Transcription(chirp_3)
Chirp 2: Transcription(chirp_2)
Gemini 2.5 Flash TTS(gemini-2.5-flash-tts)
Gemini 2.5 Flash TTS(gemini-2.5-pro-tts)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice

ML processing for Google Cloud partner models

United States

Model US multi-region
Anthropic's Claude Sonnet 4.5
Anthropic's Claude Opus 4.1
Anthropic's Claude Haiku 4.5
Anthropic's Claude Opus 4
Anthropic's Claude Sonnet 4
Anthropic's Claude 3.7 Sonnet
Anthropic's Claude 3.5 Haiku
Anthropic's Claude 3 Haiku
Mistral Medium 3
Mistral OCR (25.05)
Mistral Small 3.1 (25.03)
Mistral Large (24.07)
Codestral 2
Codestral (24.05)

Europe

Model EU multi-region Belgium(europe-west1) Netherlands(europe-west4)
Anthropic's Claude Sonnet 4.5
Anthropic's Claude Opus 4.1
Anthropic's Claude Haiku 4.5
Anthropic's Claude Opus 4
Anthropic's Claude Sonnet 4
Anthropic's Claude 3.7 Sonnet
Anthropic's Claude 3.5 Haiku
Anthropic's Claude 3 Haiku
Mistral Medium 3
Mistral OCR (25.05)
Mistral Small 3.1 (25.03)
Mistral Large (24.07)
Codestral 2
Codestral (24.05)

Asia Pacific

Model Singapore(asia-southeast1) Taiwan(asia-east1)
Anthropic's Claude Sonnet 4.5
Anthropic's Claude Opus 4.1
Anthropic's Claude Haiku 4.5
Anthropic's Claude Opus 4
Anthropic's Claude Sonnet 4
Anthropic's Claude 3.7 Sonnet
Anthropic's Claude 3.5 Haiku
Anthropic's Claude 3 Haiku
Anthropic's Claude 3 Opus
Mistral Medium 3
Mistral OCR (25.05)
Mistral Small 3.1 (25.03)
Mistral Large (24.07)
Codestral 2
Codestral (24.05)

ML processing for Google Cloud open models

United States

Model US multi-region
DeepSeek R1 (0528)
DeepSeek-OCR
DeepSeek-V3.1
gpt-oss 120B
gpt-oss 20B
Llama 3.1 8B (Preview)
Llama 3.1 70B (Preview)
Llama 3.1 405B
Llama 3.2 90B (Preview)
Llama 3.3 70B (Preview)
Llama 4 Maverick 17B-128E (Preview)
Llama 4 Scout 17B-16E (Preview)
MiniMax M2
Multilingual E5 Large
Multilingual E5 Small
Qwen3 235B
Qwen3 Coder
Qwen3-Next-80B Instruct
Qwen3-Next-80B Thinking

Europe

Model EU multi-region Belgium(europe-west1) Netherlands(europe-west4)
DeepSeek R1 (0528)
DeepSeek-OCR
DeepSeek-V3.1
gpt-oss 120B
gpt-oss 20B
Llama 3.1 8B (Preview)
Llama 3.1 70B (Preview)
Llama 3.1 405B
Llama 3.2 90B (Preview)
Llama 3.3 70B (Preview)
Llama 4 Maverick 17B-128E (Preview)
Llama 4 Scout 17B-16E (Preview)
MiniMax M2
Multilingual E5 Large
Multilingual E5 Small
Qwen3 235B
Qwen3 Coder
Qwen3-Next-80B Instruct
Qwen3-Next-80B Thinking

Asia Pacific

Model Singapore(asia-southeast1) Taiwan(asia-east1)
DeepSeek R1 (0528)
DeepSeek-OCR
DeepSeek-V3.1
gpt-oss 120B
gpt-oss 20B
Llama 3.1 8B (Preview)
Llama 3.1 70B (Preview)
Llama 3.1 405B
Llama 3.2 90B (Preview)
Llama 3.3 70B (Preview)
Llama 4 Maverick 17B-128E (Preview)
Llama 4 Scout 17B-16E (Preview)
MiniMax M2
Multilingual E5 Large
Multilingual E5 Small
Qwen3 235B
Qwen3 Coder
Qwen3-Next-80B Instruct
Qwen3-Next-80B Thinking

What's next