Serverless mode is the recommended way to get started with Vertex AI RAG Engine. It provides a fully-managed and highly scalable database to back your Vertex AI RAG Engine resources, without the hassle of provisioning or managing any infrastructure configurations. It offers quick onboarding and is suitable for most users, including enterprise scale.
Serverless mode is a preview feature and is available in us-central1 only. It is available to everyone and does not require any allowlist.
With Serverless mode, your Vertex AI RAG Engine CUJ remains identical to that with Spanner mode, except for the following key differences:
RagManagedDbis not available as a choice of vector database when using Serverless mode. Instead, the new default vector database is the fully-managed and highly scalableRagManagedVertexVectorSearch. To learn more about this vector DB, refer to Use Vector Search 2.0 with RAG.- Customer-Managed Encryption Keys (CMEK) are not supported in Serverless mode.
Switching to Serverless mode
To switch to Serverless mode as your choice of deployment mode, refer to the Switching your modes page.
Delete your data
To delete your data from Serverless deployment mode, follow these steps:
- Ensure your active mode is set to Serverless.
- Call the
ListRagCorporaAPI to view your resources under this mode, - Delete each corpus using the
DeleteRagCorpusAPI.
Pricing
There is no additional charge when using Serverless deployment mode. However, if you have an active Spanner mode, you might still get charged for it. Additionally, you might incur charges from your choice of models, reranking, and vector storage on Vertex AI RAG Engine. For more information, refer to the Vertex AI RAG Engine billing page.