Gemini 2.0 Flash

Gemini 2.0 Flash delivers next-generation features and improved capabilities designed for the agentic era, including superior speed, built-in tool use, multimodal generation, and a 1M token context window. Gemini 2.0 Flash improves upon our previous Flash model and offers enhanced quality at similar speeds.

Caution: As of June 1, 2026, gemini-2.0-flash-001 and gemini-2.0-flash-lite-001 are discontinued and are no longer available. This includes both model serving and Provisioned Throughput. Use Gemini 3.1 Flash-Lite, Gemma 4, or more recent Gemini releases.

Try in Agent Studio View in Model Garden Deploy example app

Note: "Deploy example app" requires a Google Cloud project with billing and Agent Platform API enabled.

Model ID	`gemini-2.0-flash`
Modalities	Text Input and output Image Input only Audio Input only Video Input only
Token limits	Context window	1,048,576
Token limits	Maximum output tokens	8,192 (default)
Capabilities	Thinking Not supported System instructions Supported Gemini Live API Not supported Structured output Supported Context caching Explicit context caching Supported Count Tokens Supported RAG Engine Not supported Chat completions Supported Tuning Supervised fine-tuning, tuning checkpoints Supported URL context Supported
Tools	Grounding Google Search Supported Code execution Supported Function calling Supported Computer Use Preview feature Not supported
Consumption options	Provisioned Throughput Supported Batch inference Supported Pay-as-you-go Not supported Fixed quota Not supported
Consumption options	See Consumption options for more information.
Input size limit	500 MB
Technical specifications	Image	Maximum images per prompt: 3,000 Maximum file size per file for inline data or direct uploads through the console: 7 MB Maximum file size per file from Google Cloud Storage: 30 MB Maximum tokens per minute (TPM) per project: High/Medium/Default media resolution: US/Asia: 40 M EU: 10 M Low media resolution: US/Asia: 10 M EU: 2.6 M Supported MIME types: `image/png`, `image/jpeg`, `image/webp`, `image/heic`, `image/heif`
	Text	Maximum number of files per prompt: 3,000 Maximum number of pages per file: 1,000 Maximum file size per file for the API or Cloud Storage imports: 50 MB(application/pdf) or 7 MB(text/plain) Maximum file size per file for direct uploads through the console: 7 MB Maximum tokens per minute (TPM) per project1: US/Asia: 3.4 M EU: 3.4 M Supported MIME types: `application/pdf`, `text/plain`
	Video	Maximum video length (with audio): Approximately 45 minutes Maximum video length (without audio): Approximately 1 hour Maximum number of videos per prompt: 10 Maximum tokens per minute (TPM): High/Medium/Default media resolution: US/Asia: 38 M EU: 10 M Low media resolution: US/Asia: 10 M EU: 2.5 M Supported MIME types: `video/x-flv`, `video/quicktime`, `video/mpeg`, `video/mpegs`, `video/mpg`, `video/mp4`, `video/webm`, `video/wmv`, `video/3gpp`
	Audio	Maximum audio length per prompt: Approximately 8.4 hours, or up to 1 million tokens Maximum number of audio files per prompt: 1 Speech understanding for: Audio summarization, transcription, and translation Maximum tokens per minute (TPM): US/Asia: 3.5 M EU: 3.5 M Supported MIME types: `audio/x-aac`, `audio/flac`, `audio/mp3`, `audio/m4a`, `audio/mpeg`, `audio/mpga`, `audio/mp4`, `audio/ogg`, `audio/pcm`, `audio/wav`, `audio/webm`
	Parameter defaults	Temperature: 0.0-2.0 (default 1.0) topP: 0.0-1.0 (default 0.95) topK: 64 (fixed) candidateCount: 1–8 (default 1)
Supported regions	Model availability	Global global United States us-central1 us-east1 us-east4 us-east5 us-south1 us-west1 us-west4 Europe europe-central2 europe-north1 europe-southwest1 europe-west1 europe-west4 europe-west8 europe-west9
	ML processing
	See Deployments and endpoints for more information.
Knowledge cutoff date	June 2024
Versions	`gemini-2.0-flash-001` Launch stage: Discontinued Release date: February 5, 2025 Retirement date: June 1, 2026
Pricing	See Pricing.

Gemini 2.0 Flash Stay organized with collections Save and categorize content based on your preferences.

Gemini 2.0 Flash