Overview of Model Garden

Model Garden is an AI/ML model library that helps you discover, test, customize, and deploy models and assets from Google and Google partners.

Advantages of Model Garden

When you're working with AI models, Model Garden provides the following advantages:

  • Available models are all grouped in a single location
  • Model Garden provides a consistent deployment pattern for different types of models
  • Model Garden provides built-in integration with other parts of Vertex AI such as model tuning, evaluation, and serving
  • Serving generative AI models can be difficult—Vertex AI handles model deployment and serving for you

Explore models

To view the list of available Vertex AI and open source foundation, tunable, and task-specific models, go to the Model Garden page in the Google Cloud console.

Go to Model Garden

The model categories available in Model Garden are:

Category Description
Foundation models Pretrained multitask large models that can be tuned or customized for specific tasks using Vertex AI Studio, Vertex AI API, and the Vertex AI SDK for Python.
Fine-tunable models Models that you can fine-tune using a custom notebook or pipeline.
Task-specific solutions Most of these prebuilt models are ready to use. Many can be customized using your own data.

To filter models in the filter pane, specify the following:

  • Tasks: Click the task that you want the model to perform.
  • Model collections: Click to choose models that are managed by Google, partners, or you.
  • Providers: Click the provider of the model.
  • Features: Click the features that you want in the model.

To learn more about each model, click its model card.

For a list of models available in Model Garden, see Models available in Model Garden.

Model security scanning

Google does thorough testing and benchmarking on the serving and tuning containers that we provide. Active vulnerability scanning is also applied to container artifacts.

Third-party models from featured partners undergo model checkpoint scans to ensure authenticity. Third-party models from HuggingFace Hub are scanned directly by HuggingFace and their third-party scanner for malware, pickle files, Keras Lambda layers, and secrets. Models deemed unsafe from these scans are flagged by HuggingFace and blocked from deployment in Model Garden. Models deemed suspicious or those that have the ability to potentially execute remote code are indicated in Model Garden but can still be deployed. We recommend you perform a thorough review of any suspicious model before deploying it within Model Garden.

Pricing

For the open source models in Model Garden, you are charged for use of following on Vertex AI:

Control access to specific models

You can set a Model Garden organization policy at the organization, folder, or project level to control access to specific models in Model Garden. For example, you can allow access to specific models that you've vetted and deny access to all others.

Learn more about Model Garden

For more information about the deployment options and customizations that you can do with models in Model Garden, view the resources in the following sections, which include links to tutorials, references, notebooks, and YouTube videos.

Deploy and serve

Learn more about customizing deployments and advance serving features.

Container compliance

Model Garden offers the following FedRAMP high compliant containers for model serving.

Container name Supported tasks Container image version Notebook example
PyTorch Inference v0.4 audio2text
text2image
zero-shot-image-classification
zero-shot-object-detection
csm_text2speech
dia_text2speech
image-to-text
visual-question-answering
instant-id
janus_text2image
janus_text_generation
mask-generation
nllb_translation
paligemma_v2
pix2pix
us-docker.pkg.dev/deeplearning-platform-release/vertex-model-garden/pytorch-inference.cu125.0-4.ubuntu2204.py310:model-garden.pytorch-inference-0-4-gpu-release_20250923.01_p0 HiDream-I1
SGLang Text2text generation us-docker.pkg.dev/deeplearning-platform-release/vertex-model-garden/sglang-serve.cu124.0-4.ubuntu2204.py310:model-garden.sglang-0-4-release_20250914.00_p0 Qwen3 (Deployment)
HuggingFace Inference Toolkit text2image generation
vanilla text-generation
text-classification
translation
zero-shot-object-detection
mask-generation
sentence embeddings
feature extraction
fill mask

Full task list: https://huggingface.co/docs/inference-endpoints/en/supported_tasks
us-docker.pkg.dev/deeplearning-platform-release/vertex-model-garden/hf-inference-toolkit.cu125.0-1.ubuntu2204.py311:model-garden.hf-inference-toolkit-0-1-release_20251017.00_p0 Hugging Face PyTorch Inference Deployment
HuggingFace Text Embeddings Inference (TEI) text2embeddings us-docker.pkg.dev/deeplearning-platform-release/vertex-model-garden/hf-tei.cu125.0-1.ubuntu2204.py310:model-garden.hf-tei-0-1-release_20251017.00_p0 Hugging Face Text Embeddings Inference Deployment

Tuning

Learn more about tuning models to tailor responses for specific use cases.

Evaluation

Learn more about assessing model responses with Vertex AI

Additional resources