Gemini Embedding 2

Gemini Embedding 2 is Google's embedding generation model that's ideal for complex retrieval and analytics tasks.

Gemini Embedding 2 accepts multimodal inputs to generate 3072-dimensional vectors. It accepts images, text, documents, audio, and video inputs and semantically maps the generated vectors into a unified semantic space. This lets you perform tasks, such as searching for an image based on a text description.

Gemini Embedding 2 introduces several features to optimize embedding quality and flexibility:

  • Custom task instructions: By specifying task instructions—for example, task:code retrieval or task:search result—optimize the embeddings for the intended relationships and retrieve more accurate results for the specific goal.

  • Adjustable result size: The model generates a 3072-dimensional float vector, by default. However, you can retrieve a smaller dimensional output by specifying the output_dimensionality parameter.

  • Document OCR: Read OCR from document inputs.

  • Audio track extraction: Extract audio tracks from video inputs and interleave them with video frames.

Try in Vertex AI (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model ID gemini-embedding-2-preview
Supported inputs & outputs
  • Inputs:
    Text, Images, Audio, Video, PDF
  • Outputs:
    Embeddings
Token limits
  • Maximum input tokens: 8,192
  • Maximum output tokens: N/A
Maximum sequence length

8,192 tokens

Output dimensions

Up to 3,072 (with MRL support)

Consumption options
See Consumption options for more information.
Technical specifications
Images
  • Maximum images per prompt: 6
  • Maximum file size per file for inline data or direct uploads through the console: No limit
  • Maximum file size per file from Google Cloud Storage: No limit
  • Maximum number of output images per prompt: N/A
  • Supported MIME types:
    image/png, image/jpeg
Documents
  • Maximum number of files per prompt: 1
  • Maximum number of pages per file: 6
  • Maximum file size per file: N/A
  • Supported MIME types:
    application/pdf
Video
  • Maximum video length (with audio): 80 seconds
  • Maximum video length (without audio): 120 seconds
  • Maximum number of videos per prompt: 1
  • Supported MIME types:
    video/mpeg, video/mp4
Audio
  • Maximum audio length per prompt: 80 seconds
  • Maximum number of audio files per prompt: 1
  • Supported MIME types:
    audio/mp3, audio/wav
Parameter defaults
Supported regions

Model availability

  • United States
    • us-central1
See Deployments and endpoints for more information.
Knowledge cutoff date November 2025
Versions
  • gemini-embedding-2-preview
    • Launch stage: Public preview
    • Release date: March 10, 2026
Supported languages See Supported languages.
Pricing See Pricing.