Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

Passer d'un mode à l'autre

Les modes de déploiement sont une configuration au niveau du projet. Le passage d'un mode à l'autre ne déplace ni ne supprime vos données de l'autre mode. Vous pouvez utiliser l'API UpdateRagEngineConfig pour basculer entre les modes de déploiement sans serveur et Spanner. Vous pouvez également utiliser cette API pour définir le niveau de votre mode de déploiement Spanner ou pour annuler le provisionnement de votre mode Spanner afin d'arrêter la facturation. Vous pouvez utiliser l'API GetRagEngineConfig pour lire les informations de votre mode de déploiement actuel.

Passer en mode sans serveur

Les exemples de code suivants montrent comment passer votre RagEngineConfig en mode sans serveur :

Console

Dans la Google Cloud console, accédez à la page Moteur RAG.
Accéder au moteur RAG
Sélectionnez la région dans laquelle votre moteur RAG Vertex AI s'exécute.
Cliquez sur l'option Passer en mode sans serveur. Elle n'est peut-être pas visible si vous êtes en mode sans serveur. Vous pouvez vérifier votre mode actuel à partir du libellé du mode en haut à droite de la page.

REST

PROJECT_ID: Your project ID.
LOCATION: The region to process the request.

curl -X PATCH \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragEngineConfig -d "{'ragManagedDbConfig': {'serverless': {}}}"

Python

from vertexai.preview import rag
import vertexai

PROJECT_ID = YOUR_PROJECT_ID
LOCATION = YOUR_RAG_ENGINE_LOCATION

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location=LOCATION)

rag_engine_config_name=f"projects/{PROJECT_ID}/locations/{LOCATION}/ragEngineConfig"

new_rag_engine_config = rag.RagEngineConfig(
    name=rag_engine_config_name,
    rag_managed_db_config=rag.RagManagedDbConfig(mode=rag.Serverless()),
)

updated_rag_engine_config = rag.rag_data.update_rag_engine_config(
    rag_engine_config=new_rag_engine_config
)

print(updated_rag_engine_config)

Passer en mode Spanner

Les exemples de code suivants montrent comment passer votre RagEngineConfig en mode Spanner. Si vous avez déjà utilisé le mode Spanner et que vous avez choisi un niveau, vous n'avez pas besoin de le fournir explicitement lors du changement. Sinon, reportez-vous aux exemples de code ci-dessous pour savoir comment passer en mode Spanner tout en fournissant un niveau.

Console

Dans la Google Cloud console, accédez à la page Moteur RAG.
Accéder au moteur RAG
Sélectionnez la région dans laquelle votre moteur RAG Vertex AI s'exécute.
Cliquez sur l'option Passer en mode Spanner. Elle n'est peut-être pas visible si vous êtes en mode Spanner. Vous pouvez vérifier votre mode actuel à partir du libellé du mode.

REST

PROJECT_ID: Your project ID.
LOCATION: The region to process the request.

curl -X PATCH \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragEngineConfig -d "{'ragManagedDbConfig': {'spanner': {}}}"

Python

from vertexai.preview import rag
import vertexai

PROJECT_ID = YOUR_PROJECT_ID
LOCATION = YOUR_RAG_ENGINE_LOCATION

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location=LOCATION)

rag_engine_config_name=f"projects/{PROJECT_ID}/locations/{LOCATION}/ragEngineConfig"

new_rag_engine_config = rag.RagEngineConfig(
    name=rag_engine_config_name,
    rag_managed_db_config=rag.RagManagedDbConfig(mode=rag.Spanner()),
)

updated_rag_engine_config = rag.rag_data.update_rag_engine_config(
    rag_engine_config=new_rag_engine_config
)

print(updated_rag_engine_config)

Lire votre RagEngineConfig actuelle

Les exemples de code suivants montrent comment lire votre RagEngineConfig pour voir le mode et le niveau choisis :

REST

PROJECT_ID: Your project ID.
LOCATION: The region to process the request.

curl -X GET \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragEngineConfig

Python

from vertexai.preview import rag
import vertexai

PROJECT_ID = YOUR_PROJECT_ID
LOCATION = YOUR_RAG_ENGINE_LOCATION

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location=LOCATION)

rag_engine_config = rag.rag_data.get_rag_engine_config(
    name=f"projects/{PROJECT_ID}/locations/{LOCATION}/ragEngineConfig"
)

print(rag_engine_config)

Modifier le niveau en mode Spanner

Les exemples de code suivants montrent comment modifier le niveau en mode Spanner :

Modifier votre RagEngineConfig pour passer au niveau Scaled en mode Spanner

Les exemples de code suivants montrent comment définir le RagEngineConfig sur le mode Spanner avec le niveau Scaled :

Console

Dans la Google Cloud console, accédez à la page Moteur RAG.
Accéder au moteur RAG
Sélectionnez la région dans laquelle votre moteur RAG Vertex AI s'exécute.
Cliquez sur l'option Passer en mode Spanner si vous n'êtes pas déjà en mode Spanner.
Cliquez sur Configurer le moteur RAG. Le volet Configurer le moteur RAG s'affiche.
Sélectionnez le niveau avec lequel vous souhaitez exécuter votre moteur RAG.
Cliquez sur Enregistrer.

REST

PROJECT_ID: Your project ID.
LOCATION: The region to process the request.

curl -X PATCH \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragEngineConfig -d "{'ragManagedDbConfig': {'spanner': {'scaled': {}}}}"

Python

from vertexai.preview import rag
import vertexai

PROJECT_ID = YOUR_PROJECT_ID
LOCATION = YOUR_RAG_ENGINE_LOCATION

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location=LOCATION)

rag_engine_config_name=f"projects/{PROJECT_ID}/locations/{LOCATION}/ragEngineConfig"

new_rag_engine_config = rag.RagEngineConfig(
    name=rag_engine_config_name,
    rag_managed_db_config=rag.RagManagedDbConfig(mode=rag.Spanner(tier=rag.Scaled())),
)

updated_rag_engine_config = rag.rag_data.update_rag_engine_config(
    rag_engine_config=new_rag_engine_config
)

print(updated_rag_engine_config)

Modifier votre RagEngineConfig pour passer au niveau Basic en mode Spanner

Les exemples de code suivants montrent comment définir le RagEngineConfig sur le mode Spanner avec le niveau Basic :

Console

Dans la Google Cloud console, accédez à la page Moteur RAG.
Accéder au moteur RAG
Sélectionnez la région dans laquelle votre moteur RAG Vertex AI s'exécute.
Cliquez sur l'option Passer en mode Spanner si vous n'êtes pas déjà en mode Spanner.
Cliquez sur Configurer le moteur RAG. Le volet Configurer le moteur RAG s'affiche.
Sélectionnez le niveau avec lequel vous souhaitez exécuter votre moteur RAG.
Cliquez sur Enregistrer.

REST

PROJECT_ID: Your project ID.
LOCATION: The region to process the request.

curl -X PATCH \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragEngineConfig -d "{'ragManagedDbConfig': {'spanner': {'basic': {}}}}"

Python

from vertexai.preview import rag
import vertexai

PROJECT_ID = YOUR_PROJECT_ID
LOCATION = YOUR_RAG_ENGINE_LOCATION

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location=LOCATION)

rag_engine_config_name=f"projects/{PROJECT_ID}/locations/{LOCATION}/ragEngineConfig"

new_rag_engine_config = rag.RagEngineConfig(
    name=rag_engine_config_name,
    rag_managed_db_config=rag.RagManagedDbConfig(mode=rag.Spanner(tier=rag.Basic())),
)

updated_rag_engine_config = rag.rag_data.update_rag_engine_config(
    rag_engine_config=new_rag_engine_config
)

print(updated_rag_engine_config)

Modifier votre RagEngineConfig pour passer au niveau Non provisionné

Les exemples de code suivants montrent comment définir le RagEngineConfig sur le mode Spanner avec le niveau Non provisionné. Toutes les données de votre mode de déploiement Spanner seront alors supprimées définitivement, et les frais de facturation associés seront interrompus.

Console

Dans la Google Cloud console, accédez à la page Moteur RAG.
Accéder au moteur RAG
Sélectionnez la région dans laquelle votre moteur RAG Vertex AI s'exécute.
Cliquez sur l'option Passer en mode Spanner si vous n'êtes pas déjà en mode Spanner.
Cliquez sur Supprimer le moteur RAG. Une boîte de dialogue de confirmation s'affiche.
Vérifiez que vous êtes sur le point de supprimer vos données dans le moteur RAG de Vertex AI en saisissant delete.
Cliquez sur Confirmer.
Cliquez sur Enregistrer.

REST

PROJECT_ID: Your project ID.
LOCATION: The region to process the request.

curl -X PATCH \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragEngineConfig -d "{'ragManagedDbConfig': {'spanner': {'unprovisioned': {}}}}"

Python

from vertexai.preview import rag
import vertexai

PROJECT_ID = YOUR_PROJECT_ID
LOCATION = YOUR_RAG_ENGINE_LOCATION

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location=LOCATION)

rag_engine_config_name=f"projects/{PROJECT_ID}/locations/{LOCATION}/ragEngineConfig"

new_rag_engine_config = rag.RagEngineConfig(
    name=rag_engine_config_name,
    rag_managed_db_config=rag.RagManagedDbConfig(mode=rag.Spanner(tier=rag.Unprovisioned())),
)

updated_rag_engine_config = rag.rag_data.update_rag_engine_config(
    rag_engine_config=new_rag_engine_config
)

print(updated_rag_engine_config)

Passer d'un mode à l'autre Restez organisé à l'aide des collections Enregistrez et classez les contenus selon vos préférences.

Passer en mode sans serveur

Console

REST

Python

Passer en mode Spanner

Console

REST

Python

Lire votre RagEngineConfig actuelle

REST

Python

Modifier le niveau en mode Spanner

Modifier votre RagEngineConfig pour passer au niveau Scaled en mode Spanner

Console

REST

Python

Modifier votre RagEngineConfig pour passer au niveau Basic en mode Spanner

Console

REST

Python

Modifier votre RagEngineConfig pour passer au niveau Non provisionné

Console

REST

Python

Passer d'un mode à l'autre