Supported AI models

To build generative AI applications with AlloyDB for PostgreSQL, you can register remote model endpoints using Model endpoint management. This lets you use SQL to interact with Large Language Models (LLMs) for tasks such as generating text embeddings, multimodal content, and semantic ranking.

This page provides a list of AI models that have built-in support or are pre-registered for use with AlloyDB. These AI models simplify your integration with Vertex AI, OpenAI, and Anthropic. For more information about different model types, see Model type.

Models with built-in support

AlloyDB provides built-in support for several models from AI providers. For these models, AlloyDB handles the underlying integration details automatically, such as setting up default transform functions. This occurs when you register the model endpoint using its qualified name.

This section lists the qualified model names and types for the Vertex AI, OpenAI, and Anthropic models.

Vertex AI

Model type Model name
text-embedding text-embedding-004
text-embedding text-embedding-005
text-embedding text-embedding-preview-0815
text-embedding text-multilingual-embedding-002
text-embedding text-embedding-large-exp-03-07 *
multimodal_embedding multimodalembedding@001
reranking semantic-ranker-default-003
  • The text-embedding-large-exp-03-07 model is available only in the us-central1 region.

OpenAI

Model type Qualified model name
text-embedding text-embedding-ada-002
text-embedding text-embedding-3-small
text-embedding text-embedding-3-large

Anthropic

Model type Qualified model name
generic claude-3-opus-20240229
generic claude-3-sonnet-20240229
generic claude-3-haiku-20240307

Pre-registered Vertex AI models

For the following Vertex AI models, AlloyDB provides pre-registration, which lets you call these models directly using their qualified names without requiring a manual registration step. This offers a streamlined way to use Vertex AI capabilities within your SQL queries.

The following table lists the model types and IDs of pre-registered Vertex AI models.

Model type Model ID
text-embeddingtext-embedding-005
text-embeddinggemini-embedding-001
text-embeddingtext-multilingual-embedding-002
multimodal-embeddingmultimodalembedding
multimodal-embeddingmultimodalembedding@001
rerankingsemantic-ranker-512
rerankingsemantic-ranker-default-002
rerankingsemantic-ranker-default-003
rerankingsemantic-ranker-default-004
rerankingsemantic-ranker-fast-004
llmgemini-2.0-flash
llmgemini-2.5-flash
llmgemini-2.0-flash-lite
llmgemini-2.5-flash-lite
llmgemini-2.5-pro