Send feedback
Gemini 2.5 Flash-Lite
Stay organized with collections
Save and categorize content based on your preferences.
Gemini 2.5 Flash-Lite is our most balanced Gemini model,
optimized for low latency use cases. It comes with the same capabilities that
make other Gemini 2.5 models helpful, such as the ability to turn
thinking on at different budgets, connecting to tools like
Grounding with Google Search and code execution, multimodal input, and
a 1 million-token context length.
2.5 Flash-Lite
Try in Agent Platform (Preview) Deploy example app
Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Agent Platform API enabled.
Model ID
gemini-2.5-flash-lite
Supported inputs & outputs
Inputs:
Text ,
Code ,
Images ,
Audio ,
Video
Outputs:
Token limits
Maximum input tokens: 1,048,576
Maximum output tokens: 65,535 (default)
Capabilities
Consumption options
See Consumption options for more information.
Input size limit
500 MB
Technical specifications
Images
photo
Documents
description
Video
videocam
Audio
mic
Parameter defaults tune
Supported regions
Model availability
United States
us-central1
us-east1
us-east4
us-east5
us-south1
us-west1
us-west4
Europe
europe-central2
europe-north1
europe-southwest1
europe-west1
europe-west4
europe-west8
europe-west9
See Deployments and endpoints for more information.
Knowledge cutoff date
January 2025
Versions
gemini-2.5-flash-lite
Launch stage: GA
Release date: July 22, 2025
Discontinuation date: Not before October 16, 2026
Security controls
Online prediction
Data residency
CMEK
VPC-SC
AXT
Batch prediction
Data residency
CMEK
VPC-SC
AXT
Tuning
Data residency
CMEK
VPC-SC
AXT
Context caching
Data residency
CMEK
VPC-SC
AXT
RAG Engine
Data residency
CMEK
VPC-SC
AXT
Grounding with Google Search and Grounding with Google Maps
Data residency
CMEK
VPC-SC
AXT
See Security controls for more information.
Supported languages
See Supported languages .
Pricing
See Pricing .
2.5 Flash-Lite
Preview
This feature is
subject to the "Pre-GA Offerings Terms" in the General Service Terms section of the
Service Specific
Terms .
Pre-GA features are available "as is" and might have limited support.
For more information, see the
launch stage descriptions .
Caution: gemini-2.5-flash-lite-preview-09-2025 will be discontinued on July 9, 2026. Update your application to use gemini-2.5-flash-lite or other supported model.
Try in Agent Platform (Preview) Deploy example app
Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Agent Platform API enabled.
Model ID
gemini-2.5-flash-lite-preview-09-2025
Supported inputs & outputs
Inputs:
Text ,
Code ,
Images ,
Audio ,
Video
Outputs:
Token limits
Maximum input tokens: 1,048,576
Maximum output tokens: 65,535 (default)
Capabilities
Consumption options
See Consumption options for more information.
Technical specifications
Images
photo
Documents
description
Video
videocam
Audio
mic
Parameter defaults tune
Supported regions
Model availability
See Deployments and endpoints for more information.
Knowledge cutoff date
January 2025
Versions
gemini-2.5-flash-lite-preview-09-2025
Launch stage: Public preview
Release date: September 25, 2025
Discontinuation date: July 9, 2026
Supported languages
See Supported languages .
Pricing
See Pricing .
Send feedback
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License , and code samples are licensed under the Apache 2.0 License . For details, see the Google Developers Site Policies . Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2026-05-26 UTC.
Need to tell us more?
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2026-05-26 UTC."],[],[]]