Google models

Google models on Gemini Enterprise Agent Platform offer fully managed and serverless models as APIs. To use a Google model on Agent Platform, send a request directly to the Agent Platform API endpoint. Because Google models use a managed API, there's no need to provision or manage infrastructure.

You can stream your responses to reduce the end-user latency perception. A streamed response uses server-sent events (SSE) to incrementally stream the response.

Gemma 4 26B A4B IT

Gemma 4 26B A4B IT is a multimodal model from Google handling text and image input and generating text output.

Go to the Gemma 4 26B A4B IT model card

Use Google models

For managed models, you can use curl commands to send requests to the Gemini Enterprise Agent Platform endpoint using the following model names:

  • For Gemma 4 26B A4B IT, use gemma-4-26b-a4b-it-maas

To learn how to make streaming and non-streaming calls to Google models, see Call open model APIs.

To use a self-deployed Gemini Enterprise Agent Platform model:

  1. Navigate to the Model Garden console.
  2. Find the relevant Gemini Enterprise Agent Platform model.
  3. Click Enable and complete the provided form to get the necessary commercial use licenses.

For more information about deploying and using partner models, see Deploy a partner model and make prediction requests .

Google model region availability

Google models are available in the following regions:

Model Regions
Gemma 4 26B A4B IT
  • global
    • Max output: 128,000
    • Context length: 262,144

What's next

Learn how to Call open model APIs.