Index
- DataFoundryService(interface)
- EvaluationManagementService(interface)
- EvaluationService(interface)
- ExtensionExecutionService(interface)
- ExtensionRegistryService(interface)
- GenAiCacheConfigService(interface)
- GenAiCacheService(interface)
- GenAiTuningService(interface)
- LlmBidiService(interface)
- LlmUtilityService(interface)
- MemoryBankService(interface)
- PredictionService(interface)
- ReasoningEngineExecutionService(interface)
- ReasoningEngineService(interface)
- SessionService(interface)
- VertexRagDataService(interface)
- VertexRagService(interface)
- A2aGetReasoningEngineRequest(message)
- A2aPostReasoningEngineRequest(message)
- A2aStreamGetReasoningEngineRequest(message)
- A2aStreamPostReasoningEngineRequest(message)
- AggregationOutput(message)
- AggregationResult(message)
- ApiAuth(message)
- ApiAuth.ApiKeyConfig(message)
- AppendEventRequest(message)
- AppendEventResponse(message)
- AugmentPromptRequest(message)
- AugmentPromptRequest.Model(message)
- AugmentPromptResponse(message)
- AuthConfig(message)
- AuthConfig.ApiKeyConfig(message)
- AuthConfig.GoogleServiceAccountConfig(message)
- AuthConfig.HttpBasicAuthConfig(message)
- AuthConfig.OauthConfig(message)
- AuthConfig.OidcConfig(message)
- AuthType(enum)
- AutoraterConfig(message)
- BidiGenerateContentClientContent(message)
- BidiGenerateContentClientMessage(message)
- BidiGenerateContentRealtimeInput(message)
- BidiGenerateContentRealtimeInput.ActivityEnd(message)
- BidiGenerateContentRealtimeInput.ActivityStart(message)
- BidiGenerateContentServerContent(message)
- BidiGenerateContentServerContent.Transcription(message)
- BidiGenerateContentServerContent.TurnCompleteReason(enum)
- BidiGenerateContentServerMessage(message)
- BidiGenerateContentSetup(message)
- BidiGenerateContentSetup.AudioTranscriptionConfig(message)
- BidiGenerateContentSetupComplete(message)
- BidiGenerateContentToolCall(message)
- BidiGenerateContentToolCallCancellation(message)
- BidiGenerateContentToolResponse(message)
- BidiQueryReasoningEngineRequest(message)
- BidiQueryReasoningEngineResponse(message)
- BidiQueryReasoningEngineSetup(message)
- BigQueryDestination(message)
- BigQueryRequestSet(message)
- BigQueryRequestSet.SamplingConfig(message)
- BigQueryRequestSet.SamplingConfig.SamplingMethod(enum)
- BigQuerySource(message)
- BleuInput(message)
- BleuInstance(message)
- BleuMetricValue(message)
- BleuResults(message)
- BleuSpec(message)
- Blob(message)
- CacheConfig(message)
- CachedContent(message)
- CachedContent.UsageMetadata(message)
- CancelEvaluationRunRequest(message)
- CancelTuningJobRequest(message)
- Candidate(message)
- Candidate.FinishReason(enum)
- CandidateResponse(message)
- CandidateResult(message)
- ChatCompletionsRequest(message)
- Citation(message)
- CitationMetadata(message)
- Claim(message)
- CodeExecutionResult(message)
- CodeExecutionResult.Outcome(enum)
- CoherenceInput(message)
- CoherenceInstance(message)
- CoherenceResult(message)
- CoherenceSpec(message)
- CometInput(message)
- CometInstance(message)
- CometResult(message)
- CometSpec(message)
- CometSpec.CometVersion(enum)
- ComputeTokensRequest(message)
- ComputeTokensResponse(message)
- Content(message)
- ContentMap(message)
- ContentMap.Contents(message)
- ContextWindowCompressionConfig(message)
- ContextWindowCompressionConfig.SlidingWindow(message)
- CorpusStatus(message)
- CorpusStatus.State(enum)
- CorroborateContentRequest(message)
- CorroborateContentRequest.Parameters(message)
- CorroborateContentResponse(message)
- CountTokensRequest(message)
- CountTokensResponse(message)
- CreateCachedContentRequest(message)
- CreateEvaluationItemRequest(message)
- CreateEvaluationRunRequest(message)
- CreateEvaluationSetRequest(message)
- CreateMemoryOperationMetadata(message)
- CreateMemoryRequest(message)
- CreateRagCorpusOperationMetadata(message)
- CreateRagCorpusRequest(message)
- CreateReasoningEngineOperationMetadata(message)
- CreateReasoningEngineRequest(message)
- CreateSessionOperationMetadata(message)
- CreateSessionRequest(message)
- CreateTuningJobRequest(message)
- CustomOutput(message)
- CustomOutputFormatConfig(message)
- DatasetDistribution(message)
- DatasetDistribution.DistributionBucket(message)
- DatasetStats(message)
- DeleteCachedContentRequest(message)
- DeleteEvaluationItemRequest(message)
- DeleteEvaluationRunOperationMetadata(message)
- DeleteEvaluationRunRequest(message)
- DeleteEvaluationSetRequest(message)
- DeleteExtensionRequest(message)
- DeleteMemoryOperationMetadata(message)
- DeleteMemoryRequest(message)
- DeleteOperationMetadata(message)
- DeleteRagCorpusRequest(message)
- DeleteRagFileRequest(message)
- DeleteReasoningEngineRequest(message)
- DeleteSessionRequest(message)
- DirectUploadSource(message)
- DistillationDataStats(message)
- DistillationHyperParameters(message)
- DistillationSpec(message)
- DnsPeeringConfig(message)
- DynamicRetrievalConfig(message)
- DynamicRetrievalConfig.Mode(enum)
- EmbedContentRequest(message)
- EmbedContentRequest.EmbeddingTaskType(enum)
- EmbedContentResponse(message)
- EmbedContentResponse.Embedding(message)
- EncryptionSpec(message)
- EnterpriseWebSearch(message)
- EnvVar(message)
- EvaluateDatasetOperationMetadata(message)
- EvaluateDatasetRequest(message)
- EvaluateDatasetResponse(message)
- EvaluateDatasetRun(message)
- EvaluateInstancesRequest(message)
- EvaluateInstancesResponse(message)
- EvaluationConfig(message)
- EvaluationDataset(message)
- EvaluationInstance(message)
- EvaluationInstance.AgentConfig(message)
- EvaluationInstance.AgentConfig.Tools(message)
- EvaluationInstance.AgentData(message)
- EvaluationInstance.AgentData.Events(message)
- EvaluationInstance.AgentData.Tools(message)
- EvaluationInstance.InstanceData(message)
- EvaluationInstance.InstanceData.Contents(message)
- EvaluationInstance.MapInstance(message)
- EvaluationItem(message)
- EvaluationItem.EvaluationItemType(enum)
- EvaluationPrompt(message)
- EvaluationPrompt.PromptTemplateData(message)
- EvaluationRequest(message)
- EvaluationResult(message)
- EvaluationResults(message)
- EvaluationRubricConfig(message)
- EvaluationRun(message)
- EvaluationRun.DataSource(message)
- EvaluationRun.EvaluationConfig(message)
- EvaluationRun.EvaluationConfig.AutoraterConfig(message)
- EvaluationRun.EvaluationConfig.OutputConfig(message)
- EvaluationRun.EvaluationConfig.PromptTemplate(message)
- EvaluationRun.InferenceConfig(message)
- EvaluationRun.InferenceConfig.AgentConfig(message)
- EvaluationRun.State(enum)
- EvaluationRunMetric(message)
- EvaluationRunMetric.LLMBasedMetricSpec(message)
- EvaluationRunMetric.PredefinedMetricSpec(message)
- EvaluationRunMetric.RubricBasedMetricSpec(message)
- EvaluationRunMetric.RubricBasedMetricSpec.RepeatedRubrics(message)
- EvaluationRunMetric.RubricGenerationSpec(message)
- EvaluationRunMetric.RubricGenerationSpec.RubricContentType(enum)
- EvaluationSet(message)
- EventActions(message)
- EventMetadata(message)
- ExactMatchInput(message)
- ExactMatchInstance(message)
- ExactMatchMetricValue(message)
- ExactMatchResults(message)
- ExactMatchSpec(message)
- ExecutableCode(message)
- ExecutableCode.Language(enum)
- ExecuteExtensionRequest(message)
- ExecuteExtensionResponse(message)
- Extension(message)
- ExtensionManifest(message)
- ExtensionManifest.ApiSpec(message)
- ExtensionOperation(message)
- ExtensionPrivateServiceConnectConfig(message)
- ExternalApi(message)
- ExternalApi.ApiSpec(enum)
- ExternalApi.ElasticSearchParams(message)
- ExternalApi.SimpleSearchParams(message)
- Fact(message)
- FetchPredictOperationRequest(message)
- FileData(message)
- FileStatus(message)
- FileStatus.State(enum)
- FluencyInput(message)
- FluencyInstance(message)
- FluencyResult(message)
- FluencySpec(message)
- FulfillmentInput(message)
- FulfillmentInstance(message)
- FulfillmentResult(message)
- FulfillmentSpec(message)
- FullFineTuningSpec(message)
- FunctionCall(message)
- FunctionCallingConfig(message)
- FunctionCallingConfig.Mode(enum)
- FunctionDeclaration(message)
- FunctionResponse(message)
- FunctionResponseBlob(message)
- FunctionResponseFileData(message)
- FunctionResponsePart(message)
- GcsDestination(message)
- GcsSource(message)
- GeminiPreferenceExample(message)
- GeminiPreferenceExample.Completion(message)
- GenerateContentRequest(message)
- GenerateContentResponse(message)
- GenerateContentResponse.PromptFeedback(message)
- GenerateContentResponse.PromptFeedback.BlockedReason(enum)
- GenerateContentResponse.UsageMetadata(message)
- GenerateContentResponse.UsageMetadata.TrafficType(enum)
- GenerateInstanceRubricsRequest(message)
- GenerateInstanceRubricsResponse(message)
- GenerateMemoriesOperationMetadata(message)
- GenerateMemoriesRequest(message)
- GenerateMemoriesRequest.DirectContentsSource(message)
- GenerateMemoriesRequest.DirectContentsSource.Event(message)
- GenerateMemoriesRequest.DirectMemoriesSource(message)
- GenerateMemoriesRequest.DirectMemoriesSource.DirectMemory(message)
- GenerateMemoriesRequest.VertexSessionSource(message)
- GenerateMemoriesResponse(message)
- GenerateMemoriesResponse.GeneratedMemory(message)
- GenerateMemoriesResponse.GeneratedMemory.Action(enum)
- GenerateSyntheticDataRequest(message)
- GenerateSyntheticDataResponse(message)
- GenerationConfig(message)
- GenerationConfig.MediaResolution(enum)
- GenerationConfig.Modality(enum)
- GenerationConfig.ModelConfig(message)
- GenerationConfig.ModelConfig.FeatureSelectionPreference(enum)
- GenerationConfig.RoutingConfig(message)
- GenerationConfig.RoutingConfig.AutoRoutingMode(message)
- GenerationConfig.RoutingConfig.AutoRoutingMode.ModelRoutingPreference(enum)
- GenerationConfig.RoutingConfig.ManualRoutingMode(message)
- GenerationConfig.ThinkingConfig(message)
- GenericOperationMetadata(message)
- GetCacheConfigRequest(message)
- GetCachedContentRequest(message)
- GetEvaluationItemRequest(message)
- GetEvaluationRunRequest(message)
- GetEvaluationSetRequest(message)
- GetExtensionRequest(message)
- GetMemoryRequest(message)
- GetMemoryRevisionRequest(message)
- GetRagCorpusRequest(message)
- GetRagEngineConfigRequest(message)
- GetRagFileRequest(message)
- GetReasoningEngineRequest(message)
- GetSessionRequest(message)
- GetTuningJobRequest(message)
- GoAway(message)
- GoogleDriveSource(message)
- GoogleDriveSource.ResourceId(message)
- GoogleDriveSource.ResourceId.ResourceType(enum)
- GoogleMaps(message)
- GoogleSearchRetrieval(message)
- GroundednessInput(message)
- GroundednessInstance(message)
- GroundednessResult(message)
- GroundednessSpec(message)
- GroundingChunk(message)
- GroundingChunk.Maps(message)
- GroundingChunk.Maps.PlaceAnswerSources(message)
- GroundingChunk.Maps.PlaceAnswerSources.ReviewSnippet(message)
- GroundingChunk.RetrievedContext(message)
- GroundingChunk.Web(message)
- GroundingMetadata(message)
- GroundingMetadata.SourceFlaggingUri(message)
- GroundingSupport(message)
- HarmCategory(enum)
- HttpElementLocation(enum)
- ImageConfig(message)
- ImageConfig.ImageOutputOptions(message)
- ImageConfig.PersonGeneration(enum)
- ImportExtensionOperationMetadata(message)
- ImportExtensionRequest(message)
- ImportRagFilesConfig(message)
- ImportRagFilesOperationMetadata(message)
- ImportRagFilesRequest(message)
- ImportRagFilesResponse(message)
- IntermediateExtractedMemory(message)
- InvokeRequest(message)
- JiraSource(message)
- JiraSource.JiraQueries(message)
- JobState(enum)
- LLMBasedMetricSpec(message)
- ListCachedContentsRequest(message)
- ListCachedContentsResponse(message)
- ListEvaluationItemsRequest(message)
- ListEvaluationItemsResponse(message)
- ListEvaluationRunsRequest(message)
- ListEvaluationRunsResponse(message)
- ListEvaluationSetsRequest(message)
- ListEvaluationSetsResponse(message)
- ListEventsRequest(message)
- ListEventsResponse(message)
- ListExtensionsRequest(message)
- ListExtensionsResponse(message)
- ListMemoriesRequest(message)
- ListMemoriesResponse(message)
- ListMemoryRevisionsRequest(message)
- ListMemoryRevisionsResponse(message)
- ListRagCorporaRequest(message)
- ListRagCorporaResponse(message)
- ListRagFilesRequest(message)
- ListRagFilesResponse(message)
- ListReasoningEnginesRequest(message)
- ListReasoningEnginesResponse(message)
- ListSessionsRequest(message)
- ListSessionsResponse(message)
- ListTuningJobsRequest(message)
- ListTuningJobsResponse(message)
- LogprobsResult(message)
- LogprobsResult.Candidate(message)
- LogprobsResult.TopCandidates(message)
- Memory(message)
- MemoryBankCustomizationConfig(message)
- MemoryBankCustomizationConfig.GenerateMemoriesExample(message)
- MemoryBankCustomizationConfig.GenerateMemoriesExample.ConversationSource(message)
- MemoryBankCustomizationConfig.GenerateMemoriesExample.ConversationSource.Event(message)
- MemoryBankCustomizationConfig.GenerateMemoriesExample.GeneratedMemory(message)
- MemoryBankCustomizationConfig.MemoryTopic(message)
- MemoryBankCustomizationConfig.MemoryTopic.CustomMemoryTopic(message)
- MemoryBankCustomizationConfig.MemoryTopic.ManagedMemoryTopic(message)
- MemoryBankCustomizationConfig.MemoryTopic.ManagedMemoryTopic.ManagedTopicEnum(enum)
- MemoryRevision(message)
- MemoryTopicId(message)
- Metric(message)
- Metric.AggregationMetric(enum)
- MetricResult(message)
- MetricxInput(message)
- MetricxInstance(message)
- MetricxResult(message)
- MetricxSpec(message)
- MetricxSpec.MetricxVersion(enum)
- Modality(enum)
- ModalityTokenCount(message)
- ModelArmorConfig(message)
- MultiSpeakerVoiceConfig(message)
- OptimizePromptRequest(message)
- OptimizePromptRequest.OptimizationTarget(enum)
- OptimizePromptResponse(message)
- OutputConfig(message)
- OutputFieldSpec(message)
- OutputFieldSpec.FieldType(enum)
- OutputInfo(message)
- PairwiseChoice(enum)
- PairwiseMetricInput(message)
- PairwiseMetricInstance(message)
- PairwiseMetricResult(message)
- PairwiseMetricSpec(message)
- PairwiseQuestionAnsweringQualityInput(message)
- PairwiseQuestionAnsweringQualityInstance(message)
- PairwiseQuestionAnsweringQualityResult(message)
- PairwiseQuestionAnsweringQualitySpec(message)
- PairwiseSummarizationQualityInput(message)
- PairwiseSummarizationQualityInstance(message)
- PairwiseSummarizationQualityResult(message)
- PairwiseSummarizationQualitySpec(message)
- Part(message)
- PartnerModelTuningSpec(message)
- PointwiseMetricInput(message)
- PointwiseMetricInstance(message)
- PointwiseMetricResult(message)
- PointwiseMetricSpec(message)
- PreTunedModel(message)
- PrebuiltVoiceConfig(message)
- PredefinedMetricSpec(message)
- PredictLongRunningRequest(message)
- PredictRequest(message)
- PredictResponse(message)
- PreferenceOptimizationDataStats(message)
- PreferenceOptimizationHyperParameters(message)
- PreferenceOptimizationSpec(message)
- ProactivityConfig(message)
- PscInterfaceConfig(message)
- QueryExtensionRequest(message)
- QueryExtensionResponse(message)
- QueryReasoningEngineRequest(message)
- QueryReasoningEngineResponse(message)
- QuestionAnsweringCorrectnessInput(message)
- QuestionAnsweringCorrectnessInstance(message)
- QuestionAnsweringCorrectnessResult(message)
- QuestionAnsweringCorrectnessSpec(message)
- QuestionAnsweringHelpfulnessInput(message)
- QuestionAnsweringHelpfulnessInstance(message)
- QuestionAnsweringHelpfulnessResult(message)
- QuestionAnsweringHelpfulnessSpec(message)
- QuestionAnsweringQualityInput(message)
- QuestionAnsweringQualityInstance(message)
- QuestionAnsweringQualityResult(message)
- QuestionAnsweringQualitySpec(message)
- QuestionAnsweringRelevanceInput(message)
- QuestionAnsweringRelevanceInstance(message)
- QuestionAnsweringRelevanceResult(message)
- QuestionAnsweringRelevanceSpec(message)
- RagChunk(message)
- RagChunk.PageSpan(message)
- RagContexts(message)
- RagContexts.Context(message)
- RagCorpus(message)
- RagCorpus.CorpusTypeConfig(message)
- RagCorpus.CorpusTypeConfig.DocumentCorpus(message)
- RagCorpus.CorpusTypeConfig.MemoryCorpus(message)
- RagEmbeddingModelConfig(message)
- RagEmbeddingModelConfig.HybridSearchConfig(message)
- RagEmbeddingModelConfig.SparseEmbeddingConfig(message)
- RagEmbeddingModelConfig.SparseEmbeddingConfig.Bm25(message)
- RagEmbeddingModelConfig.VertexPredictionEndpoint(message)
- RagEngineConfig(message)
- RagFile(message)
- RagFile.RagFileType(enum)
- RagFileChunkingConfig(message)
- RagFileChunkingConfig.FixedLengthChunking(message)
- RagFileMetadataConfig(message)
- RagFileParsingConfig(message)
- RagFileParsingConfig.AdvancedParser(message)
- RagFileParsingConfig.LayoutParser(message)
- RagFileParsingConfig.LlmParser(message)
- RagFileTransformationConfig(message)
- RagManagedDbConfig(message)
- RagManagedDbConfig.Basic(message)
- RagManagedDbConfig.Enterprise(message) (deprecated)
- RagManagedDbConfig.Scaled(message)
- RagManagedDbConfig.Unprovisioned(message)
- RagQuery(message)
- RagQuery.Ranking(message)
- RagRetrievalConfig(message)
- RagRetrievalConfig.Filter(message)
- RagRetrievalConfig.HybridSearch(message)
- RagRetrievalConfig.Ranking(message)
- RagRetrievalConfig.Ranking.LlmRanker(message)
- RagRetrievalConfig.Ranking.RankService(message)
- RagVectorDbConfig(message)
- RagVectorDbConfig.Pinecone(message)
- RagVectorDbConfig.RagManagedDb(message)
- RagVectorDbConfig.RagManagedDb.ANN(message)
- RagVectorDbConfig.RagManagedDb.KNN(message)
- RagVectorDbConfig.VertexFeatureStore(message)
- RagVectorDbConfig.VertexVectorSearch(message)
- RagVectorDbConfig.Weaviate(message)
- RawOutput(message)
- RawPredictRequest(message)
- RealtimeInputConfig(message)
- RealtimeInputConfig.ActivityHandling(enum)
- RealtimeInputConfig.AutomaticActivityDetection(message)
- RealtimeInputConfig.AutomaticActivityDetection.EndSensitivity(enum)
- RealtimeInputConfig.AutomaticActivityDetection.StartSensitivity(enum)
- RealtimeInputConfig.TurnCoverage(enum)
- ReasoningEngine(message)
- ReasoningEngineContextSpec(message)
- ReasoningEngineContextSpec.MemoryBankConfig(message)
- ReasoningEngineContextSpec.MemoryBankConfig.GenerationConfig(message)
- ReasoningEngineContextSpec.MemoryBankConfig.SimilaritySearchConfig(message)
- ReasoningEngineContextSpec.MemoryBankConfig.TtlConfig(message)
- ReasoningEngineContextSpec.MemoryBankConfig.TtlConfig.GranularTtlConfig(message)
- ReasoningEngineSpec(message)
- ReasoningEngineSpec.DeploymentSpec(message)
- ReasoningEngineSpec.DeploymentSpec.AgentServerMode(enum)
- ReasoningEngineSpec.PackageSpec(message)
- ReasoningEngineSpec.SourceCodeSpec(message)
- ReasoningEngineSpec.SourceCodeSpec.InlineSource(message)
- ReasoningEngineSpec.SourceCodeSpec.PythonSpec(message)
- RebaseTunedModelOperationMetadata(message)
- RebaseTunedModelRequest(message)
- Retrieval(message)
- RetrievalConfig(message)
- RetrievalMetadata(message)
- RetrieveContextsRequest(message)
- RetrieveContextsRequest.VertexRagStore(message)
- RetrieveContextsRequest.VertexRagStore.RagResource(message)
- RetrieveContextsResponse(message)
- RetrieveMemoriesRequest(message)
- RetrieveMemoriesRequest.SimilaritySearchParams(message)
- RetrieveMemoriesRequest.SimpleRetrievalParams(message)
- RetrieveMemoriesResponse(message)
- RetrieveMemoriesResponse.RetrievedMemory(message)
- RollbackMemoryOperationMetadata(message)
- RollbackMemoryRequest(message)
- RollbackMemoryResponse(message)
- RougeInput(message)
- RougeInstance(message)
- RougeMetricValue(message)
- RougeResults(message)
- RougeSpec(message)
- Rubric(message)
- Rubric.Content(message)
- Rubric.Content.Property(message)
- Rubric.Importance(enum)
- RubricBasedInstructionFollowingInput(message)
- RubricBasedInstructionFollowingInstance(message)
- RubricBasedInstructionFollowingResult(message)
- RubricBasedInstructionFollowingSpec(message)
- RubricCritiqueResult(message)
- RubricGenerationSpec(message)
- RubricGenerationSpec.RubricContentType(enum)
- RubricGroup(message)
- RubricVerdict(message)
- RuntimeConfig(message)
- RuntimeConfig.CodeInterpreterRuntimeConfig(message)
- RuntimeConfig.VertexAISearchRuntimeConfig(message)
- SafetyInput(message)
- SafetyInstance(message)
- SafetyRating(message)
- SafetyRating.HarmProbability(enum)
- SafetyRating.HarmSeverity(enum)
- SafetyResult(message)
- SafetySetting(message)
- SafetySetting.HarmBlockMethod(enum)
- SafetySetting.HarmBlockThreshold(enum)
- SafetySpec(message)
- Schema(message)
- SearchEntryPoint(message)
- SecretEnvVar(message)
- SecretRef(message)
- Segment(message)
- Session(message)
- SessionEvent(message)
- SessionResumptionConfig(message)
- SessionResumptionUpdate(message)
- SharePointSources(message)
- SharePointSources.SharePointSource(message)
- SlackSource(message)
- SlackSource.SlackChannels(message)
- SlackSource.SlackChannels.SlackChannel(message)
- SpeakerVoiceConfig(message)
- SpeechConfig(message)
- StreamDirectPredictRequest(message)
- StreamDirectPredictResponse(message)
- StreamDirectRawPredictRequest(message)
- StreamDirectRawPredictResponse(message)
- StreamQueryReasoningEngineRequest(message)
- StreamRawPredictRequest(message)
- StreamingPredictRequest(message)
- StreamingPredictResponse(message)
- StreamingRawPredictRequest(message)
- StreamingRawPredictResponse(message)
- SummarizationHelpfulnessInput(message)
- SummarizationHelpfulnessInstance(message)
- SummarizationHelpfulnessResult(message)
- SummarizationHelpfulnessSpec(message)
- SummarizationQualityInput(message)
- SummarizationQualityInstance(message)
- SummarizationQualityResult(message)
- SummarizationQualitySpec(message)
- SummarizationVerbosityInput(message)
- SummarizationVerbosityInstance(message)
- SummarizationVerbosityResult(message)
- SummarizationVerbositySpec(message)
- SummaryMetrics(message)
- SupervisedHyperParameters(message)
- SupervisedHyperParameters.AdapterSize(enum)
- SupervisedTuningDataStats(message)
- SupervisedTuningDatasetDistribution(message)
- SupervisedTuningDatasetDistribution.DatasetBucket(message)
- SupervisedTuningSpec(message)
- SupervisedTuningSpec.TuningMode(enum)
- SyntheticExample(message)
- SyntheticField(message)
- TaskDescriptionStrategy(message)
- Tensor(message)
- Tensor.DataType(enum)
- TokensInfo(message)
- Tool(message)
- Tool.CodeExecution(message)
- Tool.ComputerUse(message)
- Tool.ComputerUse.Environment(enum)
- Tool.GoogleSearch(message)
- Tool.PhishBlockThreshold(enum)
- ToolCall(message)
- ToolCallValidInput(message)
- ToolCallValidInstance(message)
- ToolCallValidMetricValue(message)
- ToolCallValidResults(message)
- ToolCallValidSpec(message)
- ToolConfig(message)
- ToolNameMatchInput(message)
- ToolNameMatchInstance(message)
- ToolNameMatchMetricValue(message)
- ToolNameMatchResults(message)
- ToolNameMatchSpec(message)
- ToolParameterKVMatchInput(message)
- ToolParameterKVMatchInstance(message)
- ToolParameterKVMatchMetricValue(message)
- ToolParameterKVMatchResults(message)
- ToolParameterKVMatchSpec(message)
- ToolParameterKeyMatchInput(message)
- ToolParameterKeyMatchInstance(message)
- ToolParameterKeyMatchMetricValue(message)
- ToolParameterKeyMatchResults(message)
- ToolParameterKeyMatchSpec(message)
- ToolUseExample(message)
- ToolUseExample.ExtensionOperation(message)
- Trajectory(message)
- TrajectoryAnyOrderMatchInput(message)
- TrajectoryAnyOrderMatchInstance(message)
- TrajectoryAnyOrderMatchMetricValue(message)
- TrajectoryAnyOrderMatchResults(message)
- TrajectoryAnyOrderMatchSpec(message)
- TrajectoryExactMatchInput(message)
- TrajectoryExactMatchInstance(message)
- TrajectoryExactMatchMetricValue(message)
- TrajectoryExactMatchResults(message)
- TrajectoryExactMatchSpec(message)
- TrajectoryInOrderMatchInput(message)
- TrajectoryInOrderMatchInstance(message)
- TrajectoryInOrderMatchMetricValue(message)
- TrajectoryInOrderMatchResults(message)
- TrajectoryInOrderMatchSpec(message)
- TrajectoryPrecisionInput(message)
- TrajectoryPrecisionInstance(message)
- TrajectoryPrecisionMetricValue(message)
- TrajectoryPrecisionResults(message)
- TrajectoryPrecisionSpec(message)
- TrajectoryRecallInput(message)
- TrajectoryRecallInstance(message)
- TrajectoryRecallMetricValue(message)
- TrajectoryRecallResults(message)
- TrajectoryRecallSpec(message)
- TrajectorySingleToolUseInput(message)
- TrajectorySingleToolUseInstance(message)
- TrajectorySingleToolUseMetricValue(message)
- TrajectorySingleToolUseResults(message)
- TrajectorySingleToolUseSpec(message)
- TunedModel(message)
- TunedModelCheckpoint(message)
- TunedModelRef(message)
- TuningDataStats(message)
- TuningJob(message)
- TuningJob.TuningJobState(enum)
- Type(enum)
- UpdateCacheConfigRequest(message)
- UpdateCachedContentRequest(message)
- UpdateEvaluationSetRequest(message)
- UpdateExtensionRequest(message)
- UpdateMemoryOperationMetadata(message)
- UpdateMemoryRequest(message)
- UpdateRagCorpusOperationMetadata(message)
- UpdateRagCorpusRequest(message)
- UpdateRagEngineConfigOperationMetadata(message)
- UpdateRagEngineConfigRequest(message)
- UpdateReasoningEngineOperationMetadata(message)
- UpdateReasoningEngineRequest(message)
- UpdateSessionRequest(message)
- UploadRagFileConfig(message)
- UrlContext(message)
- UrlContextMetadata(message)
- UrlMetadata(message)
- UrlMetadata.UrlRetrievalStatus(enum)
- UsageMetadata(message)
- UsageMetadata.TrafficType(enum)
- VeoHyperParameters(message)
- VeoHyperParameters.TuningTask(enum)
- VeoTuningSpec(message)
- VertexAISearch(message)
- VertexAISearch.DataStoreSpec(message)
- VertexAiSearchConfig(message)
- VertexRagStore(message)
- VertexRagStore.RagResource(message)
- VideoMetadata(message)
- VoiceConfig(message)
DataFoundryService
Service for generating and preparing datasets for Gen AI evaluation.
| GenerateSyntheticData | 
|---|
| 
 Generates synthetic data based on the provided configuration. | 
EvaluationManagementService
Vertex AI Evaluation Management Service.
| CancelEvaluationRun | 
|---|
| 
 Cancels an Evaluation Run. Attempts to cancel a running Evaluation Run asynchronously. Status of run can be checked via GetEvaluationRun. | 
| CreateEvaluationItem | 
|---|
| 
 Creates an Evaluation Item. | 
| CreateEvaluationRun | 
|---|
| 
 Creates an Evaluation Run. | 
| CreateEvaluationSet | 
|---|
| 
 Creates an Evaluation Set. | 
| DeleteEvaluationItem | 
|---|
| 
 Deletes an Evaluation Item. | 
| DeleteEvaluationRun | 
|---|
| 
 Deletes an Evaluation Run. | 
| DeleteEvaluationSet | 
|---|
| 
 Deletes an Evaluation Set. | 
| GetEvaluationItem | 
|---|
| 
 Gets an Evaluation Item. | 
| GetEvaluationRun | 
|---|
| 
 Gets an Evaluation Run. | 
| GetEvaluationSet | 
|---|
| 
 Gets an Evaluation Set. | 
| ListEvaluationItems | 
|---|
| 
 Lists Evaluation Items. | 
| ListEvaluationRuns | 
|---|
| 
 Lists Evaluation Runs. | 
| ListEvaluationSets | 
|---|
| 
 Lists Evaluation Sets. | 
| UpdateEvaluationSet | 
|---|
| 
 Updates an Evaluation Set. | 
EvaluationService
Vertex AI Online Evaluation Service.
| EvaluateDataset | 
|---|
| 
 Evaluates a dataset based on a set of given metrics. | 
| EvaluateInstances | 
|---|
| 
 Evaluates instances based on a given metric. 
 | 
| GenerateInstanceRubrics | 
|---|
| 
 Generates rubrics for a given prompt. A rubric represents a single testable criterion for evaluation. One input prompt could have multiple rubrics This RPC allows users to get suggested rubrics based on provided prompt, which can then be reviewed and used for subsequent evaluations. | 
ExtensionExecutionService
A service for Extension execution.
| ExecuteExtension | 
|---|
| 
 Executes the request against a given extension. 
 | 
| QueryExtension | 
|---|
| 
 Queries an extension with a default controller. 
 | 
ExtensionRegistryService
A service for managing Vertex AI's Extension registry.
| DeleteExtension | 
|---|
| 
 Deletes an Extension. 
 | 
| GetExtension | 
|---|
| 
 Gets an Extension. 
 | 
| ImportExtension | 
|---|
| 
 Imports an Extension. 
 | 
| ListExtensions | 
|---|
| 
 Lists Extensions in a location. 
 | 
| UpdateExtension | 
|---|
| 
 Updates an Extension. 
 | 
GenAiCacheConfigService
Service for GenAI Cache Config.
| GetCacheConfig | 
|---|
| 
 Gets a GenAI cache config. 
 | 
| UpdateCacheConfig | 
|---|
| 
 Updates a cache config. 
 | 
GenAiCacheService
Service for managing Vertex AI's CachedContent resource.
| CreateCachedContent | 
|---|
| 
 Creates cached content, this call will initialize the cached content in the data storage, and users need to pay for the cache data storage. 
 | 
| DeleteCachedContent | 
|---|
| 
 Deletes cached content 
 | 
| GetCachedContent | 
|---|
| 
 Gets cached content configurations 
 | 
| ListCachedContents | 
|---|
| 
 Lists cached contents in a project 
 | 
| UpdateCachedContent | 
|---|
| 
 Updates cached content configurations 
 | 
GenAiTuningService
A service for creating and managing GenAI Tuning Jobs.
| CancelTuningJob | 
|---|
| 
 Cancels a TuningJob. Starts asynchronous cancellation on the TuningJob. The server makes a best effort to cancel the job, but success is not guaranteed. Clients can use  
 | 
| CreateTuningJob | 
|---|
| 
 Creates a TuningJob. A created TuningJob right away will be attempted to be run. 
 | 
| GetTuningJob | 
|---|
| 
 Gets a TuningJob. 
 | 
| ListTuningJobs | 
|---|
| 
 Lists TuningJobs in a Location. 
 | 
| OptimizePrompt | 
|---|
| 
 Optimizes a prompt. | 
| RebaseTunedModel | 
|---|
| 
 Rebase a TunedModel. 
 | 
LlmBidiService
A service for LLM related bidirectional low latency APIs.
| BidiGenerateContent | 
|---|
| 
 Bidirectional streaming predict. 
 | 
LlmUtilityService
Service for LLM related utility functions.
| ComputeTokens | 
|---|
| 
 Return a list of tokens based on the input text. 
 | 
MemoryBankService
A service for managing memories for LLM applications.
| CreateMemory | 
|---|
| 
 Create a Memory. 
 | 
| DeleteMemory | 
|---|
| 
 Delete a Memory. 
 | 
| GenerateMemories | 
|---|
| 
 Generate memories. 
 | 
| GetMemory | 
|---|
| 
 Get a Memory. 
 | 
| GetMemoryRevision | 
|---|
| 
 Get a Memory Revision. | 
| ListMemories | 
|---|
| 
 List Memories. 
 | 
| ListMemoryRevisions | 
|---|
| 
 List Memory Revisions for a Memory. | 
| RetrieveMemories | 
|---|
| 
 Retrieve memories. 
 | 
| RollbackMemory | 
|---|
| 
 Rollback Memory to a specific revision. | 
| UpdateMemory | 
|---|
| 
 Update a Memory. 
 | 
PredictionService
A service for online predictions and explanations.
| ChatCompletions | 
|---|
| 
 Exposes an OpenAI-compatible endpoint for chat completions. 
 | 
| CountTokens | 
|---|
| 
 Perform a token counting. 
 | 
| EmbedContent | 
|---|
| 
 Embed content with multimodal inputs. 
 | 
| FetchPredictOperation | 
|---|
| 
 Fetch an asynchronous online prediction operation. 
 | 
| GenerateContent | 
|---|
| 
 Generate content with multimodal inputs. 
 | 
| Invoke | 
|---|
| 
 Forwards arbitrary HTTP requests for both streaming and non-streaming cases. To use this method, [invoke_route_prefix][Model.container_spec.invoke_route_prefix] must be set to allow the paths that will be specified in the request. 
 | 
| Predict | 
|---|
| 
 Request message for running inference on Google's generative AI models on Vertex AI. You can use this method to perform tasks like image generation, image editing, virtual try-on, visual question answering, video generation, and generating text and multimodal embeddings. To run inference on a base (non-tuned) Gemini model, see  
 | 
| PredictLongRunning | 
|---|
| 
 
 | 
| RawPredict | 
|---|
| 
 Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers: 
 
 | 
| ServerStreamingPredict | 
|---|
| 
 Perform a server-side streaming online prediction request for Vertex LLM streaming. 
 | 
| StreamDirectPredict | 
|---|
| 
 Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks. 
 | 
| StreamDirectRawPredict | 
|---|
| 
 Perform a streaming online prediction request to a gRPC model server for custom containers. 
 | 
| StreamGenerateContent | 
|---|
| 
 Generate content with multimodal inputs with streaming support. 
 | 
| StreamRawPredict | 
|---|
| 
 Perform a streaming online prediction with an arbitrary HTTP payload. 
 | 
| StreamingPredict | 
|---|
| 
 Perform a streaming online prediction request for Vertex first-party products and frameworks. 
 | 
| StreamingRawPredict | 
|---|
| 
 Perform a streaming online prediction request through gRPC. 
 | 
ReasoningEngineExecutionService
A service for executing queries on Reasoning Engine.
| A2aGetReasoningEngine | 
|---|
| 
 Get request for reasoning engine instance via the A2A get protocol apis. 
 | 
| A2aPostReasoningEngine | 
|---|
| 
 Send post request for reasoning engine instance via the A2A post protocol apis. 
 | 
| A2aStreamGetReasoningEngine | 
|---|
| 
 Stream get request for reasoning engine instance via the A2A stream get protocol apis. 
 | 
| A2aStreamPostReasoningEngine | 
|---|
| 
 Streams queries using a reasoning engine instance via the A2A streaming protocol apis. 
 | 
| BidiQueryReasoningEngine | 
|---|
| 
 Bidirectional streams queries using a reasoning engine. 
 | 
| QueryReasoningEngine | 
|---|
| 
 Queries using a reasoning engine. 
 | 
| StreamQueryReasoningEngine | 
|---|
| 
 Streams queries using a reasoning engine. 
 | 
ReasoningEngineService
A service for managing Vertex AI's Reasoning Engines.
| CreateReasoningEngine | 
|---|
| 
 Creates a reasoning engine. 
 | 
| DeleteReasoningEngine | 
|---|
| 
 Deletes a reasoning engine. 
 | 
| GetReasoningEngine | 
|---|
| 
 Gets a reasoning engine. 
 | 
| ListReasoningEngines | 
|---|
| 
 Lists reasoning engines in a location. 
 | 
| UpdateReasoningEngine | 
|---|
| 
 Updates a reasoning engine. 
 | 
SessionService
The service that manages Vertex Session related resources.
| AppendEvent | 
|---|
| 
 Appends an event to a given session. 
 | 
| CreateSession | 
|---|
| 
 Creates a new  
 | 
| DeleteSession | 
|---|
| 
 Deletes details of the specific  
 | 
| GetSession | 
|---|
| 
 Gets details of the specific  
 | 
| ListEvents | 
|---|
| 
 Lists  
 | 
| ListSessions | 
|---|
| 
 Lists  
 | 
| UpdateSession | 
|---|
| 
 Updates the specific  
 | 
VertexRagDataService
A service for managing user data for RAG.
| CreateRagCorpus | 
|---|
| 
 Creates a RagCorpus. 
 | 
| DeleteRagCorpus | 
|---|
| 
 Deletes a RagCorpus. 
 | 
| DeleteRagFile | 
|---|
| 
 Deletes a RagFile. 
 | 
| GetRagCorpus | 
|---|
| 
 Gets a RagCorpus. 
 | 
| GetRagEngineConfig | 
|---|
| 
 Gets a RagEngineConfig. 
 | 
| GetRagFile | 
|---|
| 
 Gets a RagFile. 
 | 
| ImportRagFiles | 
|---|
| 
 Import files from Google Cloud Storage or Google Drive into a RagCorpus. 
 | 
| ListRagCorpora | 
|---|
| 
 Lists RagCorpora in a Location. 
 | 
| ListRagFiles | 
|---|
| 
 Lists RagFiles in a RagCorpus. 
 | 
| UpdateRagCorpus | 
|---|
| 
 Updates a RagCorpus. 
 | 
| UpdateRagEngineConfig | 
|---|
| 
 Updates a RagEngineConfig. 
 | 
VertexRagService
A service for retrieving relevant contexts.
| AugmentPrompt | 
|---|
| 
 Given an input prompt, it returns augmented prompt from vertex rag store to guide LLM towards generating grounded responses. 
 | 
| CorroborateContent | 
|---|
| 
 Given an input text, it returns a score that evaluates the factuality of the text. It also extracts and returns claims from the text and provides supporting facts. 
 | 
| RetrieveContexts | 
|---|
| 
 Retrieves relevant contexts for a query. 
 | 
A2aGetReasoningEngineRequest
Request message for ReasoningEngineExecutionService.A2aGetReasoningEngine.
| Fields | |
|---|---|
| name | 
 Required. The full resource path of the reasoning engine, captured from the URL. Format:  | 
| a2a_endpoint | 
 Required. The http endpoint extracted from the URL path. i.e.  | 
| history_length | 
 Optional. The optional query parameter for the getTask endpoint. Mapped from "?history_length= | 
A2aPostReasoningEngineRequest
Request message for ReasoningEngineExecutionService.A2aPostReasoningEngine.
| Fields | |
|---|---|
| name | 
 Required. The full resource path of the reasoning engine, captured from the URL. | 
| a2a_endpoint | 
 Required. The a2a endpoint path, captured from the URL. e.g., v1/message:send | 
| request | Required. The A2A json payload, forwarded to the backend. | 
A2aStreamGetReasoningEngineRequest
Request message for ReasoningEngineExecutionService.A2aStreamGetReasoningEngine.
| Fields | |
|---|---|
| name | 
 Required. The full resource path of the reasoning engine, captured from the URL. | 
| a2a_endpoint | 
 Required. The http endpoint extracted from the URL path. i.e.  | 
A2aStreamPostReasoningEngineRequest
Request message for ReasoningEngineExecutionService.A2aStreamPostReasoningEngine.
| Fields | |
|---|---|
| name | 
 Required. The full resource path of the reasoning engine, captured from the URL. | 
| a2a_endpoint | 
 Required. The http endpoint extracted from the URL path. e.g., v1/message:stream. | 
| request | Required. The A2A json payload, forwarded to the backend. | 
AggregationOutput
The aggregation result for the entire dataset and all metrics.
| Fields | |
|---|---|
| dataset | The dataset used for evaluation & aggregation. | 
| aggregation_results[] | One AggregationResult per metric. | 
AggregationResult
The aggregation result for a single metric.
| Fields | |
|---|---|
| aggregation_metric | Aggregation metric. | 
| Union field aggregation_result. The aggregation result.aggregation_resultcan be only one of the following: | |
| pointwise_metric_result | Result for pointwise metric. | 
| pairwise_metric_result | Result for pairwise metric. | 
| exact_match_metric_value | Results for exact match metric. | 
| bleu_metric_value | Results for bleu metric. | 
| rouge_metric_value | Results for rouge metric. | 
ApiAuth
The generic reusable api auth config. Deprecated. Please use AuthConfig (google/cloud/aiplatform/master/auth.proto) instead.
| Fields | |
|---|---|
| Union field auth_config. The auth config.auth_configcan be only one of the following: | |
| api_key_config | The API secret. | 
ApiKeyConfig
The API secret.
| Fields | |
|---|---|
| api_key_secret_version | 
 Required. The SecretManager secret version resource name storing API key. e.g. projects/{project}/secrets/{secret}/versions/{version} | 
| api_key_string | 
 The API key string. Either this or  | 
AppendEventRequest
Request message for SessionService.AppendEvent.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the session to append event to. Format:  | 
| event | Required. The event to append to the session. | 
AppendEventResponse
This type has no fields.
Response message for SessionService.AppendEvent.
AugmentPromptRequest
Request message for AugmentPrompt.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to augment prompt. The users must have permission to make a call in the project. Format:  | 
| contents[] | Optional. Input content to augment, only text format is supported for now. | 
| model | Optional. Metadata of the backend deployed model. | 
| Union field data_source. The data source for retrieving contexts.data_sourcecan be only one of the following: | |
| vertex_rag_store | Optional. Retrieves contexts from the Vertex RagStore. | 
Model
Metadata of the backend deployed model.
| Fields | |
|---|---|
| model | 
 Optional. The model that the user will send the augmented prompt for content generation. | 
| model_version | 
 Optional. The model version of the backend deployed model. | 
AugmentPromptResponse
Response message for AugmentPrompt.
| Fields | |
|---|---|
| augmented_prompt[] | Augmented prompt, only text format is supported for now. | 
| facts[] | Retrieved facts from RAG data sources. | 
AuthConfig
Auth configuration to run the extension.
| Fields | |
|---|---|
| auth_type | Type of auth scheme. | 
| Union field  
 | |
| api_key_config | Config for API key auth. | 
| http_basic_auth_config | Config for HTTP Basic auth. | 
| google_service_account_config | Config for Google Service Account auth. | 
| oauth_config | Config for user oauth. | 
| oidc_config | Config for user OIDC auth. | 
ApiKeyConfig
Config for authentication with API key.
| Fields | |
|---|---|
| name | 
 Optional. The parameter name of the API key. E.g. If the API request is "https://example.com/act?api_key= | 
| api_key_secret | 
 Optional. The name of the SecretManager secret version resource storing the API key. Format:  
 | 
| api_key_string | 
 Optional. The API key to be used in the request directly. | 
| http_element_location | Optional. The location of the API key. | 
GoogleServiceAccountConfig
Config for Google Service Account Authentication.
| Fields | |
|---|---|
| service_account | 
 Optional. The service account that the extension execution service runs as. 
 | 
HttpBasicAuthConfig
Config for HTTP Basic Authentication.
| Fields | |
|---|---|
| credential_secret | 
 Required. The name of the SecretManager secret version resource storing the base64 encoded credentials. Format:  
 | 
OauthConfig
Config for user oauth.
| Fields | |
|---|---|
| Union field  
 | |
| access_token | 
 Access token for extension endpoint. Only used to propagate token from [[ExecuteExtensionRequest.runtime_auth_config]] at request time. | 
| service_account | 
 The service account used to generate access tokens for executing the Extension. 
 | 
OidcConfig
Config for user OIDC auth.
| Fields | |
|---|---|
| Union field  
 | |
| id_token | 
 OpenID Connect formatted ID token for extension endpoint. Only used to propagate token from [[ExecuteExtensionRequest.runtime_auth_config]] at request time. | 
| service_account | 
 The service account used to generate an OpenID Connect (OIDC)-compatible JWT token signed by the Google OIDC Provider (accounts.google.com) for extension endpoint (https://cloud.google.com/iam/docs/create-short-lived-credentials-direct#sa-credentials-oidc). 
 | 
AuthType
Type of Auth.
| Enums | |
|---|---|
| AUTH_TYPE_UNSPECIFIED | |
| NO_AUTH | No Auth. | 
| API_KEY_AUTH | API Key Auth. | 
| HTTP_BASIC_AUTH | HTTP Basic Auth. | 
| GOOGLE_SERVICE_ACCOUNT_AUTH | Google Service Account Auth. | 
| OAUTH | OAuth auth. | 
| OIDC_AUTH | OpenID Connect (OIDC) Auth. | 
AutoraterConfig
The configs for autorater. This is applicable to both EvaluateInstances and EvaluateDataset.
| Fields | |
|---|---|
| autorater_model | 
 Optional. The fully qualified name of the publisher model or tuned autorater endpoint to use. Publisher model format:  Tuned model endpoint format:  | 
| generation_config | Optional. Configuration options for model generation and outputs. | 
| sampling_count | 
 Optional. Number of samples for each instance in the dataset. If not specified, the default is 4. Minimum value is 1, maximum value is 32. | 
| flip_enabled | 
 Optional. Default is true. Whether to flip the candidate and baseline responses. This is only applicable to the pairwise metric. If enabled, also provide PairwiseMetricSpec.candidate_response_field_name and PairwiseMetricSpec.baseline_response_field_name. When rendering PairwiseMetricSpec.metric_prompt_template, the candidate and baseline fields will be flipped for half of the samples to reduce bias. | 
BidiGenerateContentClientContent
Incremental update of the current conversation delivered from the client. All the content here is unconditionally appended to the conversation history and used as part of the prompt to the model to generate content.
A message here will interrupt any current model generation.
| Fields | |
|---|---|
| turns[] | Optional. The content appended to the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history and latest request. | 
| turn_complete | 
 Optional. If true, indicates that the server content generation should start with the currently accumulated prompt. Otherwise, the server will await additional messages before starting generation. | 
BidiGenerateContentClientMessage
Messages sent by the client in the BidiGenerateContent RPC call.
| Fields | |
|---|---|
| Union field message_type. The type of the message.message_typecan be only one of the following: | |
| setup | Optional. Message to be sent in the first and only first client message. | 
| client_content | 
 Optional. Incremental update of the current conversation delivered from the client. | 
| realtime_input | 
 Optional. User input that is sent in real time. | 
| tool_response | 
 Optional. Response to a  | 
BidiGenerateContentRealtimeInput
User input that is sent in real time.
This is different from ClientContentUpdate in a few ways:
- Can be sent continuously without interruption to model generation.
- If there is a need to mix data interleaved across the  ClientContentUpdateand theRealtimeUpdate, server attempts to optimize for best response, but there are no guarantees.
- End of turn is not explicitly specified, but is rather derived from user activity (for example, end of speech).
- Even before the end of turn, the data is processed incrementally to optimize for a fast start of the response from the model.
- Is always assumed to be the user's input (cannot be used to populate conversation history). //
| Fields | |
|---|---|
| media_chunks[] | Optional. Inlined bytes data for media input. | 
| audio | Optional. These form the realtime audio input stream. | 
| video | Optional. These form the realtime video input stream. | 
| activity_start | Optional. Marks the start of user activity. This can only be sent if automatic (i.e. server-side) activity detection is disabled. | 
| activity_end | Optional. Marks the end of user activity. This can only be sent if automatic (i.e. server-side) activity detection is disabled. | 
| audio_stream_end | 
 Optional. Indicates that the audio stream has ended, e.g. because the microphone was turned off. This should only be sent when automatic activity detection is enabled (which is the default). The client can reopen the stream by sending an audio message. | 
| text | 
 Optional. These form the realtime text input stream. | 
ActivityEnd
This type has no fields.
Marks the end of user activity.
ActivityStart
This type has no fields.
Only one of the fields in this message must be set at a time. Marks the start of user activity.
BidiGenerateContentServerContent
Incremental server update generated by the model in response to client messages.
Content is generated as quickly as possible, and not in realtime. Clients may choose to buffer and play it out in realtime.
| Fields | |
|---|---|
| turn_complete | 
 Output only. If true, indicates that the model is done generating. Generation will only start in response to additional client messages. Can be set alongside  | 
| interrupted | 
 Output only. If true, indicates that a client message has interrupted current model generation. If the client is playing out the content in realtime, this is a good signal to stop and empty the current queue. If the client is playing out the content in realtime, this is a good signal to stop and empty the current playback queue. | 
| generation_complete | 
 Output only. If true, indicates that the model is done generating. When model is interrupted while generating there will be no 'generation_complete' message in interrupted turn, it will go through 'interrupted > turn_complete'. When model assumes realtime playback there will be delay between generation_complete and turn_complete that is caused by model waiting for playback to finish. | 
| grounding_metadata | Output only. Metadata specifies sources used to ground generated content. | 
| input_transcription | Optional. Input transcription. The transcription is independent to the model turn which means it doesn't imply any ordering between transcription and model turn. | 
| output_transcription | Optional. Output transcription. The transcription is independent to the model turn which means it doesn't imply any ordering between transcription and model turn. | 
| turn_complete_reason | Output only. The reason why the turn is complete. | 
| model_turn | Output only. The content that the model has generated as part of the current conversation with the user. | 
Transcription
Audio transcription message.
| Fields | |
|---|---|
| text | 
 Optional. Transcription text. | 
| finished | 
 Optional. The bool indicates the end of the transcription. | 
TurnCompleteReason
The reason why the turn is complete.
| Enums | |
|---|---|
| TURN_COMPLETE_REASON_UNSPECIFIED | Reason is unspecified. | 
| MALFORMED_FUNCTION_CALL | The function call generated by the model is invalid. | 
| RESPONSE_REJECTED | The response is rejected by the model. | 
| NEED_MORE_INPUT | Needs more input from the user. | 
BidiGenerateContentServerMessage
Response message for BidiGenerateContent RPC call.
| Fields | |
|---|---|
| usage_metadata | Output only. Usage metadata about the response(s). | 
| Union field message_type. The type of the message.message_typecan be only one of the following: | |
| setup_complete | 
 Output only. Sent in response to a  | 
| server_content | 
 Output only. Content generated by the model in response to client messages. | 
| tool_call | Output only. Request for the client to execute the  | 
| tool_call_cancellation | 
 Output only. Notification for the client that a previously issued  | 
| go_away | Output only. Server will disconnect soon. | 
| session_resumption_update | Output only. Update of the session resumption state. | 
BidiGenerateContentSetup
Message to be sent in the first and only first BidiGenerateContentClientMessage. Contains configuration that will apply for the duration of the streaming RPC.
Clients should wait for a BidiGenerateContentSetupComplete message before sending any additional messages.
| Fields | |
|---|---|
| model | 
 Required. The fully qualified name of the publisher model. Publisher model format:  | 
| generation_config | Optional. Generation config. The following fields aren't supported: 
 | 
| system_instruction | Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph. | 
| tools[] | Optional. A list of  A  | 
| session_resumption | Optional. Configures session resumption mechanism. If included, the server will send periodical  | 
| context_window_compression | 
 Optional. Configures context window compression mechanism. If included, server will compress context window to fit into given length. | 
| realtime_input_config | Optional. Configures the handling of realtime input. | 
| input_audio_transcription | Optional. The transcription of the input aligns with the input audio language. | 
| output_audio_transcription | Optional. The transcription of the output aligns with the language code specified for the output audio. | 
| proactivity | Optional. Configures the proactivity of the model. This allows the model to respond proactively to the input and to ignore irrelevant input. | 
AudioTranscriptionConfig
This type has no fields.
The audio transcription configuration.
BidiGenerateContentSetupComplete
Sent in response to a BidiGenerateContentSetup message from the client.
| Fields | |
|---|---|
| session_id | 
 Output only. The session id of the session. | 
BidiGenerateContentToolCall
Request for the client to execute the function_calls and return the responses with the matching ids.
| Fields | |
|---|---|
| function_calls[] | Output only. The function call to be executed. | 
BidiGenerateContentToolCallCancellation
Notification for the client that a previously issued ToolCallMessage with the specified ids should have been not executed and should be cancelled. If there were side-effects to those tool calls, clients may attempt to undo the tool calls. This message occurs only in cases where the clients interrupt server turns.
| Fields | |
|---|---|
| ids[] | 
 Output only. The ids of the tool calls to be cancelled. | 
BidiGenerateContentToolResponse
Client generated response to a ToolCall received from the server. Individual FunctionResponse objects are matched to the respective FunctionCall objects by the id field.
Note that in the unary and server-streaming GenerateContent APIs function calling happens by exchanging the Content parts, while in the bidi GenerateContent APIs function calling happens over these dedicated set of messages.
| Fields | |
|---|---|
| function_responses[] | Optional. The response to the function calls. | 
BidiQueryReasoningEngineRequest
Request message for [ReasoningEngineExecutionService.BidiQuery][].
| Fields | |
|---|---|
| Union field request_type. The type of the stream request.request_typecan be only one of the following: | |
| setup | Setup information for the stream. | 
| bidi_stream_input | Request to be sent to the agent application in JSON object format. Examples include text query, function calling parameters, media bytes, etc. | 
BidiQueryReasoningEngineResponse
Response message for [ReasoningEngineExecutionService.BidiQuery][].
| Fields | |
|---|---|
| bidi_stream_output | Response provided by the agent application. in JSON object format. Examples include text query, function calling parameters, media bytes, etc. | 
BidiQueryReasoningEngineSetup
Request to be sent in the first and only first bidi query request.
| Fields | |
|---|---|
| name | 
 Required. The name of the ReasoningEngine resource to use. Format:  | 
| input | Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc. | 
| class_method | 
 Optional. Class method to be used for the bidi query. It is optional and defaults to "bidi_query" if unspecified. | 
BigQueryDestination
The BigQuery location for the output content.
| Fields | |
|---|---|
| output_uri | 
 Required. BigQuery URI to a project or table, up to 2000 characters long. When only the project is specified, the Dataset and Table is created. When the full table reference is specified, the Dataset must exist and table must not exist. Accepted forms: 
 | 
BigQueryRequestSet
The request set for the evaluation run.
| Fields | |
|---|---|
| uri | 
 Required. The URI of a BigQuery table. e.g. bq://projectId.bqDatasetId.bqTableId | 
| prompt_column | 
 Optional. The name of the column that contains the requests to evaluate. This will be in evaluation_item.EvalPrompt format. | 
| rubrics_column | 
 Optional. The name of the column that contains the rubrics. This is in evaluation_rubric.RubricGroup format. | 
| candidate_response_columns | 
 Optional. Map of candidate name to candidate response column name. The column will be in evaluation_item.CandidateResponse format. | 
| sampling_config | Optional. The sampling config for the bigquery resource. | 
SamplingConfig
The sampling config.
| Fields | |
|---|---|
| sampling_count | 
 Optional. The total number of logged data to import. If available data is less than the sampling count, all data will be imported. Default is 100. | 
| sampling_method | Optional. The sampling method to use. | 
| sampling_duration | Optional. How long to wait before sampling data from the BigQuery table. If not specified, defaults to 0. | 
SamplingMethod
The sampling method to use.
| Enums | |
|---|---|
| SAMPLING_METHOD_UNSPECIFIED | Unspecified sampling method. | 
| RANDOM | Random sampling. | 
BigQuerySource
The BigQuery location for the input content.
| Fields | |
|---|---|
| input_uri | 
 Required. BigQuery URI to a table, up to 2000 characters long. Accepted forms: 
 | 
BleuInput
Input for bleu metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for bleu score metric. | 
| instances[] | Required. Repeated bleu instances. | 
BleuInstance
Spec for bleu instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
BleuMetricValue
Bleu metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Bleu score. | 
BleuResults
Results for bleu metric.
| Fields | |
|---|---|
| bleu_metric_values[] | Output only. Bleu metric values. | 
BleuSpec
Spec for bleu score metric - calculates the precision of n-grams in the prediction as compared to reference - returns a score ranging between 0 to 1.
| Fields | |
|---|---|
| use_effective_order | 
 Optional. Whether to use_effective_order to compute bleu score. | 
Blob
Content blob.
| Fields | |
|---|---|
| mime_type | 
 Required. The IANA standard MIME type of the source data. | 
| data | 
 Required. Raw bytes. | 
| display_name | 
 Optional. Display name of the blob. Used to provide a label or filename to distinguish blobs. This field is only returned in PromptMessage for prompt management. It is currently used in the Gemini GenerateContent calls only when server side tools (code_execution, google_search, and url_context) are enabled. | 
CacheConfig
Config of GenAI caching features. This is a singleton resource.
| Fields | |
|---|---|
| name | 
 Identifier. Name of the cache config. Format:  -  | 
| disable_cache | 
 If set to true, disables GenAI caching. Otherwise caching is enabled. | 
CachedContent
A resource used in LLM queries for users to explicitly specify what to cache and how to cache.
| Fields | |
|---|---|
| name | 
 Immutable. Identifier. The server-generated resource name of the cached content Format: projects/{project}/locations/{location}/cachedContents/{cached_content} | 
| display_name | 
 Optional. Immutable. The user-generated meaningful display name of the cached content. | 
| model | 
 Immutable. The name of the  | 
| system_instruction | Optional. Input only. Immutable. Developer set system instruction. Currently, text only | 
| contents[] | Optional. Input only. Immutable. The content to cache | 
| tools[] | Optional. Input only. Immutable. A list of  | 
| tool_config | Optional. Input only. Immutable. Tool config. This config is shared for all tools | 
| create_time | Output only. Creation time of the cache entry. | 
| update_time | Output only. When the cache entry was last updated in UTC time. | 
| usage_metadata | Output only. Metadata on the usage of the cached content. | 
| encryption_spec | Input only. Immutable. Customer-managed encryption key spec for a  | 
| Union field expiration. Expiration time of the cached content.expirationcan be only one of the following: | |
| expire_time | Timestamp of when this resource is considered expired. This is always provided on output, regardless of what was sent on input. | 
| ttl | Input only. The TTL for this resource. The expiration time is computed: now + TTL. | 
UsageMetadata
Metadata on the usage of the cached content.
| Fields | |
|---|---|
| total_token_count | 
 Total number of tokens that the cached content consumes. | 
| text_count | 
 Number of text characters. | 
| image_count | 
 Number of images. | 
| video_duration_seconds | 
 Duration of video in seconds. | 
| audio_duration_seconds | 
 Duration of audio in seconds. | 
CancelEvaluationRunRequest
Request message for EvaluationManagementService.CancelEvaluationRun.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationRun resource to be cancelled. Format:  | 
CancelTuningJobRequest
Request message for GenAiTuningService.CancelTuningJob.
| Fields | |
|---|---|
| name | 
 Required. The name of the TuningJob to cancel. Format:  | 
Candidate
A response candidate generated from the model.
| Fields | |
|---|---|
| index | 
 Output only. Index of the candidate. | 
| content | Output only. Content parts of the candidate. | 
| avg_logprobs | 
 Output only. Average log probability score of the candidate. | 
| logprobs_result | Output only. Log-likelihood scores for the response tokens and top tokens | 
| finish_reason | Output only. The reason why the model stopped generating tokens. If empty, the model has not stopped generating the tokens. | 
| safety_ratings[] | Output only. List of ratings for the safety of a response candidate. There is at most one rating per category. | 
| citation_metadata | Output only. Source attribution of the generated content. | 
| grounding_metadata | Output only. Metadata specifies sources used to ground generated content. | 
| url_context_metadata | Output only. Metadata related to url context retrieval tool. | 
| finish_message | 
 Output only. Describes the reason the mode stopped generating tokens in more detail. This is only filled when  | 
FinishReason
The reason why the model stopped generating tokens. If empty, the model has not stopped generating the tokens.
| Enums | |
|---|---|
| FINISH_REASON_UNSPECIFIED | The finish reason is unspecified. | 
| STOP | Token generation reached a natural stopping point or a configured stop sequence. | 
| MAX_TOKENS | Token generation reached the configured maximum output tokens. | 
| SAFETY | Token generation stopped because the content potentially contains safety violations. NOTE: When streaming, contentis empty if content filters blocks the output. | 
| RECITATION | The token generation stopped because of potential recitation. | 
| OTHER | All other reasons that stopped the token generation. | 
| BLOCKLIST | Token generation stopped because the content contains forbidden terms. | 
| PROHIBITED_CONTENT | Token generation stopped for potentially containing prohibited content. | 
| SPII | Token generation stopped because the content potentially contains Sensitive Personally Identifiable Information (SPII). | 
| MALFORMED_FUNCTION_CALL | The function call generated by the model is syntaxtically invalid (e.g. the function call generated is not parsable). | 
| MODEL_ARMOR | The model response was blocked by Model Armor. | 
| IMAGE_SAFETY | Token generation stopped because generated images has safety violations. | 
| IMAGE_PROHIBITED_CONTENT | Image generation stopped because generated images has other prohibited content. | 
| IMAGE_RECITATION | Image generation stopped due to recitation. | 
| IMAGE_OTHER | Image generation stopped because of other miscellaneous issue. | 
| UNEXPECTED_TOOL_CALL | The function call generated by the model is semantically invalid (e.g. a function call is generated when function calling is not enabled or the function is not in the function declaration). | 
| NO_IMAGE | The model was expected to generate an image, but none was generated. | 
CandidateResponse
Responses from model or agent.
| Fields | |
|---|---|
| candidate | 
 Required. The name of the candidate that produced the response. | 
| events[] | Optional. Intermediate events (such as tool calls and responses) that led to the final response. | 
| Union field data. The response from the model or agent.datacan be only one of the following: | |
| text | 
 Text response. | 
| value | Fields and values that can be used to populate the response template. | 
CandidateResult
Result for a single candidate.
| Fields | |
|---|---|
| candidate | 
 Required. The candidate that is being evaluated. The value is the same as the candidate name in the EvaluationRequest. | 
| metric | 
 Required. The metric that was evaluated. | 
| explanation | 
 Optional. The explanation for the metric. | 
| rubric_verdicts[] | Optional. The rubric verdicts for the metric. | 
| additional_results | Optional. Additional results for the metric. | 
| Union field result. The result for the metric.resultcan be only one of the following: | |
| score | 
 Optional. The score for the metric. | 
ChatCompletionsRequest
Request message for [PredictionService.ChatCompletions]
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the endpoint requested to serve the prediction. Format:  | 
| http_body | Optional. The prediction input. Supports HTTP headers and arbitrary data payload. | 
Citation
Source attributions for content.
| Fields | |
|---|---|
| start_index | 
 Output only. Start index into the content. | 
| end_index | 
 Output only. End index into the content. | 
| uri | 
 Output only. Url reference of the attribution. | 
| title | 
 Output only. Title of the attribution. | 
| license | 
 Output only. License of the attribution. | 
| publication_date | Output only. Publication date of the attribution. | 
CitationMetadata
A collection of source attributions for a piece of content.
| Fields | |
|---|---|
| citations[] | Output only. List of citations. | 
Claim
Claim that is extracted from the input text and facts that support it.
| Fields | |
|---|---|
| fact_indexes[] | 
 Indexes of the facts supporting this claim. | 
| start_index | 
 Index in the input text where the claim starts (inclusive). | 
| end_index | 
 Index in the input text where the claim ends (exclusive). | 
| score | 
 Confidence score of this corroboration. | 
CodeExecutionResult
Result of executing the [ExecutableCode].
Only generated when using the [CodeExecution] tool, and always follows a part containing the [ExecutableCode].
| Fields | |
|---|---|
| outcome | Required. Outcome of the code execution. | 
| output | 
 Optional. Contains stdout when code execution is successful, stderr or other description otherwise. | 
Outcome
Enumeration of possible outcomes of the code execution.
| Enums | |
|---|---|
| OUTCOME_UNSPECIFIED | Unspecified status. This value should not be used. | 
| OUTCOME_OK | Code execution completed successfully. | 
| OUTCOME_FAILED | Code execution finished but with a failure. stderrshould contain the reason. | 
| OUTCOME_DEADLINE_EXCEEDED | Code execution ran for too long, and was cancelled. There may or may not be a partial output present. | 
CoherenceInput
Input for coherence metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for coherence score metric. | 
| instance | Required. Coherence instance. | 
CoherenceInstance
Spec for coherence instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
CoherenceResult
Spec for coherence result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for coherence score. | 
| score | 
 Output only. Coherence score. | 
| confidence | 
 Output only. Confidence for coherence score. | 
CoherenceSpec
Spec for coherence score metric.
| Fields | |
|---|---|
| version | 
 Optional. Which version to use for evaluation. | 
CometInput
Input for Comet metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for comet metric. | 
| instance | Required. Comet instance. | 
CometInstance
Spec for Comet instance - The fields used for evaluation are dependent on the comet version.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| source | 
 Optional. Source text in original language. | 
CometResult
Spec for Comet result - calculates the comet score for the given instance using the version specified in the spec.
| Fields | |
|---|---|
| score | 
 Output only. Comet score. Range depends on version. | 
CometSpec
Spec for Comet metric.
| Fields | |
|---|---|
| source_language | 
 Optional. Source language in BCP-47 format. | 
| target_language | 
 Optional. Target language in BCP-47 format. Covers both prediction and reference. | 
| version | Required. Which version to use for evaluation. | 
CometVersion
Comet version options.
| Enums | |
|---|---|
| COMET_VERSION_UNSPECIFIED | Comet version unspecified. | 
| COMET_22_SRC_REF | Comet 22 for translation + source + reference (source-reference-combined). | 
ComputeTokensRequest
Request message for ComputeTokens RPC call.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to get lists of tokens and token ids. | 
| instances[] | Optional. The instances that are the input to token computing API call. Schema is identical to the prediction schema of the text model, even for the non-text models, like chat models, or Codey models. | 
| model | 
 Optional. The name of the publisher model requested to serve the prediction. Format: projects/{project}/locations/{location}/publishers/*/models/* | 
| contents[] | Optional. Input content. | 
ComputeTokensResponse
Response message for ComputeTokens RPC call.
| Fields | |
|---|---|
| tokens_info[] | Lists of tokens info from the input. A ComputeTokensRequest could have multiple instances with a prompt in each instance. We also need to return lists of tokens info for the request with multiple instances. | 
Content
The base structured datatype containing multi-part content of a message.
A Content includes a role field designating the producer of the Content and a parts field containing multi-part data that contains the content of the message turn.
| Fields | |
|---|---|
| role | 
 Optional. The producer of the content. Must be either 'user' or 'model'. Useful to set for multi-turn conversations, otherwise can be left blank or unset. | 
| parts[] | Required. Ordered  | 
ContentMap
Map of placeholder in metric prompt template to contents of model input.
| Fields | |
|---|---|
| values | 
 Optional. Map of placeholder to contents. | 
Contents
Repeated Content type.
| Fields | |
|---|---|
| contents[] | Optional. Repeated contents. | 
ContextWindowCompressionConfig
Enables context window compression -- mechanism managing model context window so it does not exceed given length.
| Fields | |
|---|---|
| Union field compression_mechanism. Context window compression mechanism.compression_mechanismcan be only one of the following: | |
| sliding_window | Sliding window compression mechanism. | 
| trigger_tokens | 
 Number of tokens (before running turn) that triggers context window compression mechanism. This can be also used as latency/quality knob. Shorter windows might run faster. If not set 80% of model context window will be used, leaving 20% for next user request/model response. | 
SlidingWindow
Context window will be truncated by keeping only suffix of it. Context window will always be cut at start of USER role turn. System instructions and BidiGenerateContentSetup.prefix_turns will not be subject to the sliding window mechanism, they will always stay at the beginning of context window.
| Fields | |
|---|---|
| target_tokens | 
 Session reduction target -- how many tokens we should keep. Window shortening operation has some latency costs, so we should avoid running it on every turn. Should be < trigger_tokens. If not set, trigger_tokens/2 is assumed. | 
CorpusStatus
RagCorpus status.
| Fields | |
|---|---|
| state | Output only. RagCorpus life state. | 
| error_status | 
 Output only. Only when the  | 
State
RagCorpus life state.
| Enums | |
|---|---|
| UNKNOWN | This state is not supposed to happen. | 
| INITIALIZED | RagCorpus resource entry is initialized, but hasn't done validation. | 
| ACTIVE | RagCorpus is provisioned successfully and is ready to serve. | 
| ERROR | RagCorpus is in a problematic situation. See error_messagefield for details. | 
CorroborateContentRequest
Request message for CorroborateContent.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to corroborate text. The users must have permission to make a call in the project. Format:  | 
| facts[] | Optional. Facts used to generate the text can also be used to corroborate the text. | 
| parameters | Optional. Parameters that can be set to override default settings per request. | 
| content | Optional. Input content to corroborate, only text format is supported for now. | 
Parameters
Parameters that can be overrided per request.
| Fields | |
|---|---|
| citation_threshold | 
 Optional. Only return claims with citation score larger than the threshold. | 
CorroborateContentResponse
Response message for CorroborateContent.
| Fields | |
|---|---|
| claims[] | Claims that are extracted from the input content and facts that support the claims. | 
| corroboration_score | 
 Confidence score of corroborating content. Value is [0,1] with 1 is the most confidence. | 
CountTokensRequest
Request message for PredictionService.CountTokens.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to perform token counting. Format:  | 
| model | 
 Optional. The name of the publisher model requested to serve the prediction. Format:  | 
| instances[] | Optional. The instances that are the input to token counting call. Schema is identical to the prediction schema of the underlying model. | 
| contents[] | Optional. Input content. | 
| tools[] | Optional. A list of  A  | 
| system_instruction | Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph. | 
| generation_config | Optional. Generation config that the model will use to generate the response. | 
CountTokensResponse
Response message for PredictionService.CountTokens.
| Fields | |
|---|---|
| total_tokens | 
 The total number of tokens counted across all instances from the request. | 
| total_billable_characters | 
 The total number of billable characters counted across all instances from the request. | 
| prompt_tokens_details[] | Output only. List of modalities that were processed in the request input. | 
CreateCachedContentRequest
Request message for GenAiCacheService.CreateCachedContent.
| Fields | |
|---|---|
| parent | 
 Required. The parent resource where the cached content will be created | 
| cached_content | Required. The cached content to create | 
CreateEvaluationItemRequest
Request message for EvaluationManagementService.CreateEvaluationItem.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to create the Evaluation Item in. Format:  | 
| evaluation_item | Required. The Evaluation Item to create. | 
CreateEvaluationRunRequest
Request message for EvaluationManagementService.CreateEvaluationRun.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to create the Evaluation Run in. Format:  | 
| evaluation_run | Required. The Evaluation Run to create. | 
CreateEvaluationSetRequest
Request message for EvaluationManagementService.CreateEvaluationSet.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to create the Evaluation Set in. Format:  | 
| evaluation_set | Required. The Evaluation Set to create. | 
CreateMemoryOperationMetadata
Details of MemoryBankService.CreateMemory operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
CreateMemoryRequest
Request message for MemoryBankService.CreateMemory.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the ReasoningEngine to create the Memory under. Format:  | 
| memory | Required. The Memory to be created. | 
CreateRagCorpusOperationMetadata
Runtime operation information for VertexRagDataService.CreateRagCorpus.
| Fields | |
|---|---|
| generic_metadata | The operation generic information. | 
CreateRagCorpusRequest
Request message for VertexRagDataService.CreateRagCorpus.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to create the RagCorpus in. Format:  | 
| rag_corpus | Required. The RagCorpus to create. | 
CreateReasoningEngineOperationMetadata
Details of ReasoningEngineService.CreateReasoningEngine operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
CreateReasoningEngineRequest
Request message for ReasoningEngineService.CreateReasoningEngine.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to create the ReasoningEngine in. Format:  | 
| reasoning_engine | Required. The ReasoningEngine to create. | 
CreateSessionOperationMetadata
Metadata associated with the SessionService.CreateSession operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
CreateSessionRequest
Request message for SessionService.CreateSession.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the location to create the session in. Format:  | 
| session | Required. The session to create. | 
CreateTuningJobRequest
Request message for GenAiTuningService.CreateTuningJob.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to create the TuningJob in. Format:  | 
| tuning_job | Required. The TuningJob to create. | 
CustomOutput
Spec for custom output.
| Fields | |
|---|---|
| Union field custom_output. Custom output.custom_outputcan be only one of the following: | |
| raw_outputs | Output only. List of raw output strings. | 
CustomOutputFormatConfig
Spec for custom output format configuration.
| Fields | |
|---|---|
| Union field custom_output_format_config. Custom output format configuration.custom_output_format_configcan be only one of the following: | |
| return_raw_output | 
 Optional. Whether to return raw output. | 
DatasetDistribution
Distribution computed over a tuning dataset.
| Fields | |
|---|---|
| sum | 
 Output only. Sum of a given population of values. | 
| min | 
 Output only. The minimum of the population values. | 
| max | 
 Output only. The maximum of the population values. | 
| mean | 
 Output only. The arithmetic mean of the values in the population. | 
| median | 
 Output only. The median of the values in the population. | 
| p5 | 
 Output only. The 5th percentile of the values in the population. | 
| p95 | 
 Output only. The 95th percentile of the values in the population. | 
| buckets[] | Output only. Defines the histogram bucket. | 
DistributionBucket
Dataset bucket used to create a histogram for the distribution given a population of values.
| Fields | |
|---|---|
| count | 
 Output only. Number of values in the bucket. | 
| left | 
 Output only. Left bound of the bucket. | 
| right | 
 Output only. Right bound of the bucket. | 
DatasetStats
Statistics computed over a tuning dataset.
| Fields | |
|---|---|
| tuning_dataset_example_count | 
 Output only. Number of examples in the tuning dataset. | 
| total_tuning_character_count | 
 Output only. Number of tuning characters in the tuning dataset. | 
| total_billable_character_count | 
 Output only. Number of billable characters in the tuning dataset. | 
| tuning_step_count | 
 Output only. Number of tuning steps for this Tuning Job. | 
| user_input_token_distribution | Output only. Dataset distributions for the user input tokens. | 
| user_message_per_example_distribution | Output only. Dataset distributions for the messages per example. | 
| user_dataset_examples[] | Output only. Sample user messages in the training dataset uri. | 
| dropped_example_indices[] | 
 Output only. A partial sample of the indices (starting from 1) of the dropped examples. | 
| dropped_example_reasons[] | 
 Output only. For each index in  | 
| user_output_token_distribution | Output only. Dataset distributions for the user output tokens. | 
DeleteCachedContentRequest
Request message for GenAiCacheService.DeleteCachedContent.
| Fields | |
|---|---|
| name | 
 Required. The resource name referring to the cached content | 
DeleteEvaluationItemRequest
Request message for EvaluationManagementService.DeleteEvaluationItem.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationItem resource to be deleted. Format:  | 
DeleteEvaluationRunOperationMetadata
Operation metadata for EvaluationManagementService.DeleteEvaluationRun.
| Fields | |
|---|---|
| generic_metadata | Generic operation metadata. | 
DeleteEvaluationRunRequest
Request message for EvaluationManagementService.DeleteEvaluationRun.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationRun resource to be deleted. Format:  | 
DeleteEvaluationSetRequest
Request message for EvaluationManagementService.DeleteEvaluationSet.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationSet resource to be deleted. Format:  | 
DeleteExtensionRequest
Request message for ExtensionRegistryService.DeleteExtension.
| Fields | |
|---|---|
| name | 
 Required. The name of the Extension resource to be deleted. Format:  | 
DeleteMemoryOperationMetadata
Details of MemoryBankService.DeleteMemory operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
DeleteMemoryRequest
Request message for MemoryBankService.DeleteMemory.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the Memory to delete. Format:  | 
DeleteOperationMetadata
Details of operations that perform deletes of any entities.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
DeleteRagCorpusRequest
Request message for VertexRagDataService.DeleteRagCorpus.
| Fields | |
|---|---|
| name | 
 Required. The name of the RagCorpus resource to be deleted. Format:  | 
| force | 
 Optional. If set to true, any RagFiles in this RagCorpus will also be deleted. Otherwise, the request will only work if the RagCorpus has no RagFiles. | 
DeleteRagFileRequest
Request message for VertexRagDataService.DeleteRagFile.
| Fields | |
|---|---|
| name | 
 Required. The name of the RagFile resource to be deleted. Format:  | 
| force_delete | 
 Optional. If set to true, any errors generated by external vector database during the deletion will be ignored. The default value is false. | 
DeleteReasoningEngineRequest
Request message for ReasoningEngineService.DeleteReasoningEngine.
| Fields | |
|---|---|
| name | 
 Required. The name of the ReasoningEngine resource to be deleted. Format:  | 
| force | 
 Optional. If set to true, child resources of this reasoning engine will also be deleted. Otherwise, the request will fail with FAILED_PRECONDITION error when the reasoning engine has undeleted child resources. | 
DeleteSessionRequest
Request message for SessionService.DeleteSession.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the session. Format:  | 
DirectUploadSource
This type has no fields.
The input content is encapsulated and uploaded in the request.
DistillationDataStats
Statistics computed for datasets used for distillation.
| Fields | |
|---|---|
| training_dataset_stats | Output only. Statistics computed for the training dataset. | 
DistillationHyperParameters
Hyperparameters for Distillation.
| Fields | |
|---|---|
| adapter_size | Optional. Adapter size for distillation. | 
| epoch_count | 
 Optional. Number of complete passes the model makes over the entire training dataset during training. | 
| learning_rate_multiplier | 
 Optional. Multiplier for adjusting the default learning rate. | 
DistillationSpec
Tuning Spec for Distillation.
| Fields | |
|---|---|
| training_dataset_uri | 
 Deprecated. Cloud Storage path to file containing training dataset for tuning. The dataset must be formatted as a JSONL file. | 
| hyper_parameters | Optional. Hyperparameters for Distillation. | 
| student_model | 
 The student model that is being tuned, e.g., "google/gemma-2b-1.1-it". Deprecated. Use base_model instead. | 
| pipeline_root_directory | 
 Deprecated. A path in a Cloud Storage bucket, which will be treated as the root output directory of the distillation pipeline. It is used by the system to generate the paths of output artifacts. | 
| Union field teacher_model. The teacher model that is being distilled from. See Supported models.teacher_modelcan be only one of the following: | |
| base_teacher_model | 
 The base teacher model that is being distilled. See Supported models. | 
| tuned_teacher_model_source | 
 The resource name of the Tuned teacher model. Format:  | 
| validation_dataset_uri | 
 Optional. Cloud Storage path to file containing validation dataset for tuning. The dataset must be formatted as a JSONL file. | 
DnsPeeringConfig
DNS peering configuration. These configurations are used to create DNS peering zones in the Vertex tenant project VPC, enabling resolution of records within the specified domain hosted in the target network's Cloud DNS.
| Fields | |
|---|---|
| domain | 
 Required. The DNS name suffix of the zone being peered to, e.g., "my-internal-domain.corp.". Must end with a dot. | 
| target_project | 
 Required. The project ID hosting the Cloud DNS managed zone that contains the 'domain'. The Vertex AI Service Agent requires the dns.peer role on this project. | 
| target_network | 
 Required. The VPC network name in the target_project where the DNS zone specified by 'domain' is visible. | 
DynamicRetrievalConfig
Describes the options to customize dynamic retrieval.
| Fields | |
|---|---|
| mode | The mode of the predictor to be used in dynamic retrieval. | 
| dynamic_threshold | 
 Optional. The threshold to be used in dynamic retrieval. If not set, a system default value is used. | 
Mode
The mode of the predictor to be used in dynamic retrieval.
| Enums | |
|---|---|
| MODE_UNSPECIFIED | Always trigger retrieval. | 
| MODE_DYNAMIC | Run retrieval only when system decides it is necessary. | 
EmbedContentRequest
Request message for PredictionService.EmbedContent.
| Fields | |
|---|---|
| model | 
 Required. The name of the publisher model requested to serve the prediction. Format:  | 
| content | Required. Input content to be embedded. Required. | 
| title | 
 Optional. An optional title for the text. | 
| task_type | Optional. The task type of the embedding. | 
| output_dimensionality | 
 Optional. Optional reduced dimension for the output embedding. If set, excessive values in the output embedding are truncated from the end. | 
| auto_truncate | 
 Optional. Whether to silently truncate the input content if it's longer than the maximum sequence length. | 
EmbeddingTaskType
Represents a downstream task the embeddings will be used for.
| Enums | |
|---|---|
| UNSPECIFIED | Unset value, which will default to one of the other enum values. | 
| RETRIEVAL_QUERY | Specifies the given text is a query in a search/retrieval setting. | 
| RETRIEVAL_DOCUMENT | Specifies the given text is a document from the corpus being searched. | 
| SEMANTIC_SIMILARITY | Specifies the given text will be used for STS. | 
| CLASSIFICATION | Specifies that the given text will be classified. | 
| CLUSTERING | Specifies that the embeddings will be used for clustering. | 
| QUESTION_ANSWERING | Specifies that the embeddings will be used for question answering. | 
| FACT_VERIFICATION | Specifies that the embeddings will be used for fact verification. | 
| CODE_RETRIEVAL_QUERY | Specifies that the embeddings will be used for code retrieval. | 
EmbedContentResponse
Response message for PredictionService.EmbedContent.
| Fields | |
|---|---|
| embedding | The embedding generated from the input content. | 
| usage_metadata | Metadata about the response(s). | 
| truncated | 
 Whether the input content was truncated before generating the embedding. | 
Embedding
A list of floats representing an embedding.
| Fields | |
|---|---|
| values[] | 
 Embedding vector values. | 
EncryptionSpec
Represents a customer-managed encryption key spec that can be applied to a top-level resource.
| Fields | |
|---|---|
| kms_key_name | 
 Required. The Cloud KMS resource identifier of the customer managed encryption key used to protect a resource. Has the form:  | 
EnterpriseWebSearch
Tool to search public web data, powered by Vertex AI Search and Sec4 compliance.
| Fields | |
|---|---|
| exclude_domains[] | 
 Optional. List of domains to be excluded from the search results. The default limit is 2000 domains. | 
| blocking_confidence | Optional. Sites with confidence level chosen & above this value will be blocked from the search results. | 
EnvVar
Represents an environment variable present in a Container or Python Module.
| Fields | |
|---|---|
| name | 
 Required. Name of the environment variable. Must be a valid C identifier. | 
| value | 
 Required. Variables that reference a $(VAR_NAME) are expanded using the previous defined environment variables in the container and any service environment variables. If a variable cannot be resolved, the reference in the input string will be unchanged. The $(VAR_NAME) syntax can be escaped with a double $$, ie: $$(VAR_NAME). Escaped references will never be expanded, regardless of whether the variable exists or not. | 
EvaluateDatasetOperationMetadata
Operation metadata for Dataset Evaluation.
| Fields | |
|---|---|
| generic_metadata | Generic operation metadata. | 
EvaluateDatasetRequest
Request message for EvaluationService.EvaluateDataset.
| Fields | |
|---|---|
| location | 
 Required. The resource name of the Location to evaluate the dataset. Format:  | 
| dataset | Required. The dataset used for evaluation. | 
| metrics[] | Required. The metrics used for evaluation. | 
| output_config | Required. Config for evaluation output. | 
| autorater_config | Optional. Autorater config used for evaluation. Currently only publisher Gemini models are supported. Format:  | 
EvaluateDatasetResponse
Response in LRO for EvaluationService.EvaluateDataset.
| Fields | |
|---|---|
| aggregation_output | Output only. Aggregation statistics derived from results of EvaluationService.EvaluateDataset. | 
| output_info | Output only. Output info for EvaluationService.EvaluateDataset. | 
EvaluateDatasetRun
Evaluate Dataset Run Result for Tuning Job.
| Fields | |
|---|---|
| operation_name | 
 Output only. The operation ID of the evaluation run. Format:  | 
| checkpoint_id | 
 Output only. The checkpoint id used in the evaluation run. Only populated when evaluating checkpoints. | 
| evaluate_dataset_response | Output only. Results for EvaluationService.EvaluateDataset. | 
| error | Output only. The error of the evaluation run if any. | 
EvaluateInstancesRequest
Request message for EvaluationService.EvaluateInstances.
| Fields | |
|---|---|
| location | 
 Required. The resource name of the Location to evaluate the instances. Format:  | 
| metrics[] | The metrics used for evaluation. Currently, we only support evaluating a single metric. If multiple metrics are provided, only the first one will be evaluated. | 
| instance | The instance to be evaluated. | 
| autorater_config | Optional. Autorater config used for evaluation. | 
| Union field metric_inputs. Instances and specs for evaluationmetric_inputscan be only one of the following: | |
| exact_match_input | Auto metric instances. Instances and metric spec for exact match metric. | 
| bleu_input | Instances and metric spec for bleu metric. | 
| rouge_input | Instances and metric spec for rouge metric. | 
| fluency_input | LLM-based metric instance. General text generation metrics, applicable to other categories. Input for fluency metric. | 
| coherence_input | Input for coherence metric. | 
| safety_input | Input for safety metric. | 
| groundedness_input | Input for groundedness metric. | 
| fulfillment_input | Input for fulfillment metric. | 
| summarization_quality_input | Input for summarization quality metric. | 
| pairwise_summarization_quality_input | 
 Input for pairwise summarization quality metric. | 
| summarization_helpfulness_input | Input for summarization helpfulness metric. | 
| summarization_verbosity_input | Input for summarization verbosity metric. | 
| question_answering_quality_input | Input for question answering quality metric. | 
| pairwise_question_answering_quality_input | 
 Input for pairwise question answering quality metric. | 
| question_answering_relevance_input | 
 Input for question answering relevance metric. | 
| question_answering_helpfulness_input | 
 Input for question answering helpfulness metric. | 
| question_answering_correctness_input | 
 Input for question answering correctness metric. | 
| pointwise_metric_input | Input for pointwise metric. | 
| pairwise_metric_input | Input for pairwise metric. | 
| tool_call_valid_input | Tool call metric instances. Input for tool call valid metric. | 
| tool_name_match_input | Input for tool name match metric. | 
| tool_parameter_key_match_input | Input for tool parameter key match metric. | 
| tool_parameter_kv_match_input | Input for tool parameter key value match metric. | 
| comet_input | Translation metrics. Input for Comet metric. | 
| metricx_input | Input for Metricx metric. | 
| trajectory_exact_match_input | Input for trajectory exact match metric. | 
| trajectory_in_order_match_input | Input for trajectory in order match metric. | 
| trajectory_any_order_match_input | Input for trajectory match any order metric. | 
| trajectory_precision_input | Input for trajectory precision metric. | 
| trajectory_recall_input | Input for trajectory recall metric. | 
| trajectory_single_tool_use_input | Input for trajectory single tool use metric. | 
| rubric_based_instruction_following_input | 
 Rubric Based Instruction Following metric. | 
EvaluateInstancesResponse
Response message for EvaluationService.EvaluateInstances.
| Fields | |
|---|---|
| metric_results[] | Metric results for each instance. The order of the metric results is guaranteed to be the same as the order of the instances in the request. | 
| Union field evaluation_results. Evaluation results will be served in the same order as presented in EvaluationRequest.instances.evaluation_resultscan be only one of the following: | |
| exact_match_results | Auto metric evaluation results. Results for exact match metric. | 
| bleu_results | Results for bleu metric. | 
| rouge_results | Results for rouge metric. | 
| fluency_result | LLM-based metric evaluation result. General text generation metrics, applicable to other categories. Result for fluency metric. | 
| coherence_result | Result for coherence metric. | 
| safety_result | Result for safety metric. | 
| groundedness_result | Result for groundedness metric. | 
| fulfillment_result | Result for fulfillment metric. | 
| summarization_quality_result | Summarization only metrics. Result for summarization quality metric. | 
| pairwise_summarization_quality_result | 
 Result for pairwise summarization quality metric. | 
| summarization_helpfulness_result | 
 Result for summarization helpfulness metric. | 
| summarization_verbosity_result | Result for summarization verbosity metric. | 
| question_answering_quality_result | 
 Question answering only metrics. Result for question answering quality metric. | 
| pairwise_question_answering_quality_result | 
 Result for pairwise question answering quality metric. | 
| question_answering_relevance_result | 
 Result for question answering relevance metric. | 
| question_answering_helpfulness_result | 
 Result for question answering helpfulness metric. | 
| question_answering_correctness_result | 
 Result for question answering correctness metric. | 
| pointwise_metric_result | Generic metrics. Result for pointwise metric. | 
| pairwise_metric_result | Result for pairwise metric. | 
| tool_call_valid_results | Tool call metrics. Results for tool call valid metric. | 
| tool_name_match_results | Results for tool name match metric. | 
| tool_parameter_key_match_results | Results for tool parameter key match metric. | 
| tool_parameter_kv_match_results | Results for tool parameter key value match metric. | 
| comet_result | Translation metrics. Result for Comet metric. | 
| metricx_result | Result for Metricx metric. | 
| trajectory_exact_match_results | Result for trajectory exact match metric. | 
| trajectory_in_order_match_results | Result for trajectory in order match metric. | 
| trajectory_any_order_match_results | 
 Result for trajectory any order match metric. | 
| trajectory_precision_results | Result for trajectory precision metric. | 
| trajectory_recall_results | Results for trajectory recall metric. | 
| trajectory_single_tool_use_results | 
 Results for trajectory single tool use metric. | 
| rubric_based_instruction_following_result | 
 Result for rubric based instruction following metric. | 
EvaluationConfig
Evaluation Config for Tuning Job.
| Fields | |
|---|---|
| metrics[] | Required. The metrics used for evaluation. | 
| output_config | Required. Config for evaluation output. | 
| autorater_config | Optional. Autorater config for evaluation. | 
EvaluationDataset
The dataset used for evaluation.
| Fields | |
|---|---|
| Union field source. The source of the dataset.sourcecan be only one of the following: | |
| gcs_source | Cloud storage source holds the dataset. Currently only one Cloud Storage file path is supported. | 
| bigquery_source | BigQuery source holds the dataset. | 
EvaluationInstance
A single instance to be evaluated. Instances are used to specify the input data for evaluation, from simple string comparisons to complex, multi-turn model evaluations
| Fields | |
|---|---|
| prompt | Optional. Data used to populate placeholder  | 
| rubric_groups | 
 Optional. Named groups of rubrics associated with the prompt. This is used for rubric-based evaluations where rubrics can be referenced by a key. The key could represent versions, associated metrics, etc. | 
| response | Required. Data used to populate placeholder  | 
| reference | Optional. Data used to populate placeholder  | 
| other_data | Optional. Other data used to populate placeholders based on their key. | 
| agent_data | Optional. Data used for agent evaluation. | 
AgentConfig
Configuration for an Agent.
| Fields | |
|---|---|
| developer_instruction | Optional. A field containing instructions from the developer for the agent. | 
| Union field tools_data. Data for the tools available to the agent.tools_datacan be only one of the following: | |
| tools_text | 
 A JSON string containing a list of tools available to an agent with info such as name, description, parameters and required parameters. | 
| tools | List of tools. | 
Tools
Represents a list of tools for an agent.
| Fields | |
|---|---|
| tool[] | Optional. List of tools: each tool can have multiple function declarations. | 
AgentData
Contains data specific to agent evaluations.
| Fields | |
|---|---|
| developer_instruction | Optional. A field containing instructions from the developer for the agent. | 
| agent_config | Optional. Agent configuration. | 
| Union field tools_data. Data for the tools available to the agent.tools_datacan be only one of the following: | |
| tools_text | 
 A JSON string containing a list of tools available to an agent with info such as name, description, parameters and required parameters. Example: [ { "name": "search_actors", "description": "Search for actors in a movie. Returns a list of actors, their roles, their birthdate, and their place of birth.", "parameters": [ { "name": "movie_name", "description": "The name of the movie." }, { "name": "character_name", "description": "The name of the character." } ], "required": ["movie_name", "character_name"] } ] | 
| tools | List of tools. | 
| Union field events_data. The sequence of function calls and function responses that form the agent's trajectory.events_datacan be only one of the following: | |
| events_text | 
 A JSON string containing a sequence of events. | 
| events | A list of events. | 
Events
Represents a list of events for an agent.
| Fields | |
|---|---|
| event[] | Optional. A list of events. | 
Tools
Represents a list of tools for an agent.
| Fields | |
|---|---|
| tool[] | Optional. List of tools: each tool can have multiple function declarations. | 
InstanceData
Instance data used to populate placeholders in a metric prompt template.
| Fields | |
|---|---|
| Union field data. Supported formats for instance data.datacan be only one of the following: | |
| text | 
 Text data. | 
| contents | List of Gemini content data. | 
Contents
List of standard Content messages from Gemini API.
| Fields | |
|---|---|
| contents[] | Optional. Repeated contents. | 
MapInstance
Instance data specified as a map.
| Fields | |
|---|---|
| map_instance | 
 Optional. Map of instance data. | 
EvaluationItem
EvaluationItem is a single evaluation request or result. The content of an EvaluationItem is immutable - it cannot be updated once created. EvaluationItems can be deleted when no longer needed.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the EvaluationItem. Format:  | 
| display_name | 
 Required. The display name of the EvaluationItem. | 
| metadata | Optional. Metadata for the EvaluationItem. | 
| labels | 
 Optional. Labels for the EvaluationItem. | 
| evaluation_item_type | Required. The type of the EvaluationItem. | 
| create_time | Output only. Timestamp when this item was created. | 
| error | Output only. Error for the evaluation item. | 
| Union field payload. The request or response for the EvaluationItem.payloadcan be only one of the following: | |
| evaluation_request | The request to evaluate. | 
| evaluation_response | Output only. The response from evaluation. | 
| gcs_uri | 
 The Cloud Storage object where the request or response is stored. | 
EvaluationItemType
The type of the EvaluationItem.
| Enums | |
|---|---|
| EVALUATION_ITEM_TYPE_UNSPECIFIED | The default value. This value is unused. | 
| REQUEST | The EvaluationItem is a request to evaluate. | 
| RESULT | The EvaluationItem is the result of evaluation. | 
EvaluationPrompt
Prompt to be evaluated.
| Fields | |
|---|---|
| Union field data. Prompt can be in one of the following formats.datacan be only one of the following: | |
| text | 
 Text prompt. | 
| value | Fields and values that can be used to populate the prompt template. | 
| prompt_template_data | Prompt template data. | 
PromptTemplateData
Message to hold a prompt template and the values to populate the template.
| Fields | |
|---|---|
| values | 
 The values for fields in the prompt template. | 
EvaluationRequest
Single evaluation request.
| Fields | |
|---|---|
| prompt | Required. The request/prompt to evaluate. | 
| golden_response | Optional. The Ideal response or ground truth. | 
| rubrics | 
 Optional. Named groups of rubrics associated with this prompt. The key is a user-defined name for the rubric group. | 
| candidate_responses[] | Optional. Responses from model under test and other baseline models for comparison. | 
EvaluationResult
Evaluation result.
| Fields | |
|---|---|
| evaluation_request | 
 Required. The request item that was evaluated. Format: projects/{project}/locations/{location}/evaluationItems/{evaluation_item} | 
| evaluation_run | 
 Required. The evaluation run that was used to generate the result. Format: projects/{project}/locations/{location}/evaluationRuns/{evaluation_run} | 
| request | Required. The request that was evaluated. | 
| metric | 
 Required. The metric that was evaluated. | 
| candidate_results[] | Optional. The results for the metric. | 
| metadata | Optional. Metadata about the evaluation result. | 
EvaluationResults
The results of the evaluation run.
| Fields | |
|---|---|
| summary_metrics | Optional. The summary metrics for the evaluation run. | 
| evaluation_set | 
 The evaluation set where item level results are stored. | 
EvaluationRubricConfig
Configuration for a rubric group to be generated/saved for evaluation.
| Fields | |
|---|---|
| rubric_group_key | 
 Required. The key used to save the generated rubrics. If a generation spec is provided, this key will be used for the name of the generated rubric group. Otherwise, this key will be used to look up the existing rubric group on the evaluation item. Note that if a rubric group key is specified on both a rubric config and an evaluation metric, the key from the metric will be used to select the rubrics for evaluation. | 
| Union field generation_config. The configuration for generating rubrics.generation_configcan be only one of the following: | |
| rubric_generation_spec | Dynamically generate rubrics using this specification. | 
| predefined_rubric_generation_spec | Dynamically generate rubrics using a predefined spec. | 
EvaluationRun
EvaluationRun is a resource that represents a single evaluation run, which includes a set of prompts, model responses, evaluation configuration and the resulting metrics.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the EvaluationRun. This is a unique identifier. Format:  | 
| display_name | 
 Required. The display name of the Evaluation Run. | 
| metadata | Optional. Metadata about the evaluation run, can be used by the caller to store additional tracking information about the evaluation run. | 
| labels | 
 Optional. Labels for the evaluation run. | 
| data_source | Required. The data source for the evaluation run. | 
| inference_configs | 
 Optional. The candidate to inference config map for the evaluation run. The candidate can be up to 128 characters long and can consist of any UTF-8 characters. | 
| evaluation_config | Required. The configuration used for the evaluation. | 
| state | Output only. The state of the evaluation run. | 
| error | Output only. Only populated when the evaluation run's state is FAILED or CANCELLED. | 
| evaluation_results | Output only. The results of the evaluation run. Only populated when the evaluation run's state is SUCCEEDED. | 
| create_time | Output only. Time when the evaluation run was created. | 
| completion_time | Output only. Time when the evaluation run was completed. | 
| evaluation_set_snapshot | 
 Output only. The specific evaluation set of the evaluation run. For runs with an evaluation set input, this will be that same set. For runs with BigQuery input, it's the sampled BigQuery dataset. | 
DataSource
The data source for the evaluation run.
| Fields | |
|---|---|
| Union field source. One of multiple supported sources.sourcecan be only one of the following: | |
| evaluation_set | 
 The EvaluationSet resource name. Format:  | 
| bigquery_request_set | Evaluation data in bigquery. | 
EvaluationConfig
The Evalution configuration used for the evaluation run.
| Fields | |
|---|---|
| metrics[] | Required. The metrics to be calculated in the evaluation run. | 
| rubric_configs[] | Optional. The rubric configs for the evaluation run. They are used to generate rubrics which can be used by rubric-based metrics. Multiple rubric configs can be specified for rubric generation but only one rubric config can be used for a rubric-based metric. If more than one rubric config is provided, the evaluation metric must specify a rubric group key. Note that if a generation spec is specified on both a rubric config and an evaluation metric, the rubrics generated for the metric will be used for evaluation. | 
| output_config | Optional. The output config for the evaluation run. | 
| autorater_config | Optional. The autorater config for the evaluation run. | 
| prompt_template | The prompt template used for inference. The values for variables in the prompt template are defined in EvaluationItem.EvaluationPrompt.PromptTemplateData.values. | 
AutoraterConfig
The autorater config used for the evaluation run.
| Fields | |
|---|---|
| autorater_model | 
 Optional. The fully qualified name of the publisher model or tuned autorater endpoint to use. Publisher model format:  Tuned model endpoint format:  | 
| generation_config | Optional. Configuration options for model generation and outputs. | 
| sample_count | 
 Optional. Number of samples for each instance in the dataset. If not specified, the default is 4. Minimum value is 1, maximum value is 32. | 
OutputConfig
The output config for the evaluation run.
| Fields | |
|---|---|
| bigquery_destination | BigQuery destination for evaluation output. | 
| gcs_destination | Cloud Storage destination for evaluation output. | 
PromptTemplate
Prompt template used for inference.
| Fields | |
|---|---|
| Union field source. The source of the prompt template.sourcecan be only one of the following: | |
| prompt_template | 
 Inline prompt template. Template variables should be in the format "{var_name}". Example: "Translate the following from {source_lang} to {target_lang}: {text}" | 
| gcs_uri | 
 Prompt template stored in Cloud Storage. Format: "gs://my-bucket/file-name.txt". | 
InferenceConfig
An inference config used for model inference during the evaluation run.
| Fields | |
|---|---|
| model | 
 Optional. The fully qualified name of the publisher model or endpoint to use. Publisher model format:  Endpoint format:  | 
| agent_config | Optional. Agent config used to generate responses. | 
| Union field model_config. Configuration for the LLM.model_configcan be only one of the following: | |
| generation_config | Optional. Generation config. | 
AgentConfig
Configuration that describes an agent.
| Fields | |
|---|---|
| developer_instruction | Optional. The developer instruction for the agent. | 
| tools[] | Optional. The tools available to the agent. | 
State
The state of the evaluation run.
| Enums | |
|---|---|
| STATE_UNSPECIFIED | Unspecified state. | 
| PENDING | The evaluation run is pending. | 
| RUNNING | The evaluation run is running. | 
| SUCCEEDED | The evaluation run has succeeded. | 
| FAILED | The evaluation run has failed. | 
| CANCELLED | The evaluation run has been cancelled. | 
| INFERENCE | The evaluation run is performing inference. | 
| GENERATING_RUBRICS | The evaluation run is performing rubric generation. | 
EvaluationRunMetric
The metric used for evaluation runs.
| Fields | |
|---|---|
| metric | 
 Required. The name of the metric. | 
| metric_config | The metric config. | 
| Union field metric_spec. The metric spec used for evaluation.metric_speccan be only one of the following: | |
| rubric_based_metric_spec | Spec for rubric based metric. | 
| predefined_metric_spec | Spec for a pre-defined metric. | 
| llm_based_metric_spec | Spec for an LLM based metric. | 
LLMBasedMetricSpec
Specification for an LLM based metric.
| Fields | |
|---|---|
| Union field rubrics_source. Source of the rubrics to be used for evaluation.rubrics_sourcecan be only one of the following: | |
| rubric_group_key | 
 Use a pre-defined group of rubrics associated with the input. Refers to a key in the rubric_groups map of EvaluationInstance. | 
| rubric_generation_spec | Dynamically generate rubrics using this specification. | 
| predefined_rubric_generation_spec | Dynamically generate rubrics using a predefined spec. | 
| metric_prompt_template | 
 Required. Template for the prompt sent to the judge model. | 
| system_instruction | 
 Optional. System instructions for the judge model. | 
| judge_autorater_config | Optional. Optional configuration for the judge LLM (Autorater). | 
| additional_config | Optional. Optional additional configuration for the metric. | 
PredefinedMetricSpec
Specification for a pre-defined metric.
| Fields | |
|---|---|
| metric_spec_name | 
 Required. The name of a pre-defined metric, such as "instruction_following_v1" or "text_quality_v1". | 
| parameters | Optional. The parameters needed to run the pre-defined metric. | 
RubricBasedMetricSpec
Specification for a metric that is based on rubrics.
| Fields | |
|---|---|
| metric_prompt_template | 
 Optional. Template for the prompt used by the judge model to evaluate against rubrics. | 
| Union field rubrics_source. Source of the rubrics to be used for evaluation.rubrics_sourcecan be only one of the following: | |
| inline_rubrics | Use rubrics provided directly in the spec. | 
| rubric_group_key | 
 Use a pre-defined group of rubrics associated with the input content. This refers to a key in the  | 
| rubric_generation_spec | Dynamically generate rubrics for evaluation using this specification. | 
| judge_autorater_config | Optional. Optional configuration for the judge LLM (Autorater). The definition of AutoraterConfig needs to be provided. | 
RepeatedRubrics
Defines a list of rubrics, used when providing rubrics inline.
| Fields | |
|---|---|
| rubrics[] | The list of rubrics. | 
RubricGenerationSpec
Specification for how rubrics should be generated.
| Fields | |
|---|---|
| prompt_template | 
 Optional. Template for the prompt used to generate rubrics. The details should be updated based on the most-recent recipe requirements. | 
| rubric_content_type | Optional. The type of rubric content to be generated. | 
| rubric_type_ontology[] | 
 Optional. An optional, pre-defined list of allowed types for generated rubrics. If this field is provided, it implies  | 
| model_config | Optional. Configuration for the model used in rubric generation. Configs including sampling count and base model can be specified here. Flipping is not supported for rubric generation. | 
RubricContentType
Specifies the type of rubric content to generate.
| Enums | |
|---|---|
| RUBRIC_CONTENT_TYPE_UNSPECIFIED | The content type to generate is not specified. | 
| PROPERTY | Generate rubrics based on properties. | 
| NL_QUESTION_ANSWER | Generate rubrics in an NL question answer format. | 
| PYTHON_CODE_ASSERTION | Generate rubrics in a unit test format. | 
EvaluationSet
EvaluationSet is a collection of related EvaluationItems that are evaluated together.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the EvaluationSet. Format:  | 
| display_name | 
 Required. The display name of the EvaluationSet. | 
| evaluation_items[] | 
 Required. The EvaluationItems that are part of this dataset. | 
| create_time | Output only. Timestamp when this item was created. | 
| update_time | Output only. Timestamp when this item was last updated. | 
| metadata | Optional. Metadata for the EvaluationSet. | 
EventActions
Actions are parts of events that are executed by the agent.
| Fields | |
|---|---|
| skip_summarization | 
 Optional. If true, it won't call model to summarize function response. Only used for function_response event. | 
| state_delta | Optional. Indicates that the event is updating the state with the given delta. | 
| artifact_delta | 
 Optional. Indicates that the event is updating an artifact. key is the filename, value is the version. | 
| escalate | 
 Optional. The agent is escalating to a higher level agent. | 
| requested_auth_configs | Optional. Will only be set by a tool response indicating tool request euc. Struct key is the function call id since one function call response (from model) could correspond to multiple function calls. Struct value is the required auth config, which can be another struct. | 
| transfer_agent | 
 Optional. If set, the event transfers to the specified agent. | 
EventMetadata
Metadata relating to a LLM response event.
| Fields | |
|---|---|
| grounding_metadata | Optional. Metadata returned to client when grounding is enabled. | 
| partial | 
 Optional. Indicates whether the text content is part of a unfinished text stream. Only used for streaming mode and when the content is plain text. | 
| turn_complete | 
 Optional. Indicates whether the response from the model is complete. Only used for streaming mode. | 
| interrupted | 
 Optional. Flag indicating that LLM was interrupted when generating the content. Usually it's due to user interruption during a bidi streaming. | 
| long_running_tool_ids[] | 
 Optional. Set of ids of the long running function calls. Agent client will know from this field about which function call is long running. Only valid for function call event. | 
| branch | 
 Optional. The branch of the event. The format is like agent_1.agent_2.agent_3, where agent_1 is the parent of agent_2, and agent_2 is the parent of agent_3. Branch is used when multiple child agents shouldn't see their siblings' conversation history. | 
| custom_metadata | The custom metadata of the LlmResponse. | 
ExactMatchInput
Input for exact match metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for exact match metric. | 
| instances[] | Required. Repeated exact match instances. | 
ExactMatchInstance
Spec for exact match instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
ExactMatchMetricValue
Exact match metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Exact match score. | 
ExactMatchResults
Results for exact match metric.
| Fields | |
|---|---|
| exact_match_metric_values[] | Output only. Exact match metric values. | 
ExactMatchSpec
This type has no fields.
Spec for exact match metric - returns 1 if prediction and reference exactly matches, otherwise 0.
ExecutableCode
Code generated by the model that is meant to be executed, and the result returned to the model.
Generated when using the [CodeExecution] tool, in which the code will be automatically executed, and a corresponding [CodeExecutionResult] will also be generated.
| Fields | |
|---|---|
| language | Required. Programming language of the  | 
| code | 
 Required. The code to be executed. | 
Language
Supported programming languages for the generated code.
| Enums | |
|---|---|
| LANGUAGE_UNSPECIFIED | Unspecified language. This value should not be used. | 
| PYTHON | Python >= 3.10, with numpy and simpy available. | 
ExecuteExtensionRequest
Request message for ExtensionExecutionService.ExecuteExtension.
| Fields | |
|---|---|
| name | 
 Required. Name (identifier) of the extension; Format:  | 
| operation_id | 
 Required. The desired ID of the operation to be executed in this extension as defined in  | 
| operation_params | Optional. Request parameters that will be used for executing this operation. The struct should be in a form of map with param name as the key and actual param value as the value. E.g. If this operation requires a param "name" to be set to "abc". you can set this to something like {"name": "abc"}. | 
| runtime_auth_config | Optional. Auth config provided at runtime to override the default value in [Extension.manifest.auth_config][]. The AuthConfig.auth_type should match the value in [Extension.manifest.auth_config][]. | 
ExecuteExtensionResponse
Response message for ExtensionExecutionService.ExecuteExtension.
| Fields | |
|---|---|
| content | 
 Response content from the extension. The content should be conformant to the response.content schema in the extension's manifest/OpenAPI spec. | 
Extension
Extensions are tools for large language models to access external data, run computations, etc.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the Extension. | 
| display_name | 
 Required. The display name of the Extension. The name can be up to 128 characters long and can consist of any UTF-8 characters. | 
| description | 
 Optional. The description of the Extension. | 
| create_time | Output only. Timestamp when this Extension was created. | 
| update_time | Output only. Timestamp when this Extension was most recently updated. | 
| etag | 
 Optional. Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens. | 
| manifest | Required. Manifest of the Extension. | 
| extension_operations[] | Output only. Supported operations. | 
| runtime_config | Optional. Runtime config controlling the runtime behavior of this Extension. | 
| tool_use_examples[] | Optional. Examples to illustrate the usage of the extension as a tool. | 
| private_service_connect_config | 
 Optional. The PrivateServiceConnect config for the extension. If specified, the service endpoints associated with the Extension should be registered with private network access in the provided Service Directory. If the service contains more than one endpoint with a network, the service will arbitrarilty choose one of the endpoints to use for extension execution. | 
| satisfies_pzs | 
 Output only. Reserved for future use. | 
| satisfies_pzi | 
 Output only. Reserved for future use. | 
ExtensionManifest
Manifest spec of an Extension needed for runtime execution.
| Fields | |
|---|---|
| name | 
 Required. Extension name shown to the LLM. The name can be up to 128 characters long. | 
| description | 
 Required. The natural language description shown to the LLM. It should describe the usage of the extension, and is essential for the LLM to perform reasoning. e.g., if the extension is a data store, you can let the LLM know what data it contains. | 
| api_spec | Required. Immutable. The API specification shown to the LLM. | 
| auth_config | Required. Immutable. Type of auth supported by this extension. | 
ApiSpec
The API specification shown to the LLM.
| Fields | |
|---|---|
| Union field  
 | |
| open_api_yaml | 
 The API spec in Open API standard and YAML format. | 
| open_api_gcs_uri | 
 Cloud Storage URI pointing to the OpenAPI spec. | 
ExtensionOperation
Operation of an extension.
| Fields | |
|---|---|
| operation_id | 
 Operation ID that uniquely identifies the operations among the extension. See: "Operation Object" in https://swagger.io/specification/. This field is parsed from the OpenAPI spec. For HTTP extensions, if it does not exist in the spec, we will generate one from the HTTP method and path. | 
| function_declaration | Output only. Structured representation of a function declaration as defined by the OpenAPI Spec. | 
ExtensionPrivateServiceConnectConfig
PrivateExtensionConfig configuration for the extension.
| Fields | |
|---|---|
| service_directory | 
 Required. The Service Directory resource name in which the service endpoints associated to the extension are registered. Format:  
 | 
ExternalApi
Retrieve from data source powered by external API for grounding. The external API is not owned by Google, but need to follow the pre-defined API spec.
| Fields | |
|---|---|
| api_spec | The API spec that the external API implements. | 
| endpoint | 
 The endpoint of the external API. The system will call the API at this endpoint to retrieve the data for grounding. Example: https://acme.com:443/search | 
| api_auth | The authentication config to access the API. Deprecated. Please use auth_config instead. | 
| auth_config | The authentication config to access the API. | 
| Union field params. Parameters for the API call. This should be matched with the API spec used.paramscan be only one of the following: | |
| simple_search_params | Parameters for the simple search API. | 
| elastic_search_params | Parameters for the elastic search API. | 
ApiSpec
The API spec that the external API implements.
| Enums | |
|---|---|
| API_SPEC_UNSPECIFIED | Unspecified API spec. This value should not be used. | 
| SIMPLE_SEARCH | Simple search API spec. | 
| ELASTIC_SEARCH | Elastic search API spec. | 
ElasticSearchParams
The search parameters to use for the ELASTIC_SEARCH spec.
| Fields | |
|---|---|
| index | 
 The ElasticSearch index to use. | 
| search_template | 
 The ElasticSearch search template to use. | 
| num_hits | 
 Optional. Number of hits (chunks) to request. When specified, it is passed to Elasticsearch as the  | 
SimpleSearchParams
This type has no fields.
The search parameters to use for SIMPLE_SEARCH spec.
Fact
The fact used in grounding.
| Fields | |
|---|---|
| query | 
 Query that is used to retrieve this fact. | 
| title | 
 If present, it refers to the title of this fact. | 
| uri | 
 If present, this uri links to the source of the fact. | 
| summary | 
 If present, the summary/snippet of the fact. | 
| vector_distance | 
 If present, the distance between the query vector and this fact vector. | 
| score | 
 If present, according to the underlying Vector DB and the selected metric type, the score can be either the distance or the similarity between the query and the fact and its range depends on the metric type. For example, if the metric type is COSINE_DISTANCE, it represents the distance between the query and the fact. The larger the distance, the less relevant the fact is to the query. The range is [0, 2], while 0 means the most relevant and 2 means the least relevant. | 
| chunk | If present, chunk properties. | 
FetchPredictOperationRequest
Request message for PredictionService.FetchPredictOperation.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| operation_name | 
 Required. The server-assigned name for the operation. | 
FileData
URI based data.
| Fields | |
|---|---|
| mime_type | 
 Required. The IANA standard MIME type of the source data. | 
| file_uri | 
 Required. URI. | 
| display_name | 
 Optional. Display name of the file data. Used to provide a label or filename to distinguish file datas. This field is only returned in PromptMessage for prompt management. It is currently used in the Gemini GenerateContent calls only when server side tools (code_execution, google_search, and url_context) are enabled. | 
FileStatus
RagFile status.
| Fields | |
|---|---|
| state | Output only. RagFile state. | 
| error_status | 
 Output only. Only when the  | 
State
RagFile state.
| Enums | |
|---|---|
| STATE_UNSPECIFIED | RagFile state is unspecified. | 
| ACTIVE | RagFile resource has been created and indexed successfully. | 
| ERROR | RagFile resource is in a problematic state. See error_messagefield for details. | 
FluencyInput
Input for fluency metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for fluency score metric. | 
| instance | Required. Fluency instance. | 
FluencyInstance
Spec for fluency instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
FluencyResult
Spec for fluency result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for fluency score. | 
| score | 
 Output only. Fluency score. | 
| confidence | 
 Output only. Confidence for fluency score. | 
FluencySpec
Spec for fluency score metric.
| Fields | |
|---|---|
| version | 
 Optional. Which version to use for evaluation. | 
FulfillmentInput
Input for fulfillment metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for fulfillment score metric. | 
| instance | Required. Fulfillment instance. | 
FulfillmentInstance
Spec for fulfillment instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| instruction | 
 Required. Inference instruction prompt to compare prediction with. | 
FulfillmentResult
Spec for fulfillment result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for fulfillment score. | 
| score | 
 Output only. Fulfillment score. | 
| confidence | 
 Output only. Confidence for fulfillment score. | 
FulfillmentSpec
Spec for fulfillment metric.
| Fields | |
|---|---|
| version | 
 Optional. Which version to use for evaluation. | 
FullFineTuningSpec
Tuning Spec for Full Fine Tuning.
| Fields | |
|---|---|
| training_dataset_uri | 
 Required. Training dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset. | 
| validation_dataset_uri | 
 Optional. Validation dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset. | 
| hyper_parameters | Optional. Hyperparameters for Full Fine Tuning. | 
FunctionCall
A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing the parameters and their values.
| Fields | |
|---|---|
| id | 
 Optional. The unique id of the function call. If populated, the client to execute the  | 
| name | 
 Optional. The name of the function to call. Matches [FunctionDeclaration.name]. | 
| args | Optional. The function parameters and values in JSON object format. See [FunctionDeclaration.parameters] for parameter details. | 
FunctionCallingConfig
Function calling config.
| Fields | |
|---|---|
| mode | Optional. Function calling mode. | 
| allowed_function_names[] | 
 Optional. Function names to call. Only set when the Mode is ANY. Function names should match [FunctionDeclaration.name]. With mode set to ANY, model will predict a function call from the set of function names provided. | 
Mode
Function calling mode.
| Enums | |
|---|---|
| MODE_UNSPECIFIED | Unspecified function calling mode. This value should not be used. | 
| AUTO | Default model behavior, model decides to predict either function calls or natural language response. | 
| ANY | Model is constrained to always predicting function calls only. If "allowed_function_names" are set, the predicted function calls will be limited to any one of "allowed_function_names", else the predicted function calls will be any one of the provided "function_declarations". | 
| NONE | Model will not predict any function calls. Model behavior is same as when not passing any function declarations. | 
| VALIDATED | Model is constrained to predict either function calls or natural language response. If "allowed_function_names" are set, the predicted function calls will be limited to any one of "allowed_function_names", else the predicted function calls will be any one of the provided "function_declarations". | 
FunctionDeclaration
Structured representation of a function declaration as defined by the OpenAPI 3.0 specification. Included in this declaration are the function name, description, parameters and response type. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client.
| Fields | |
|---|---|
| name | 
 Required. The name of the function to call. Must start with a letter or an underscore. Must be a-z, A-Z, 0-9, or contain underscores, dots and dashes, with a maximum length of 64. | 
| description | 
 Optional. Description and purpose of the function. Model uses it to decide how and whether to call the function. | 
| parameters | Optional. Describes the parameters to this function in JSON Schema Object format. Reflects the Open API 3.03 Parameter Object. string Key: the name of the parameter. Parameter names are case sensitive. Schema Value: the Schema defining the type used for the parameter. For function with no parameters, this can be left unset. Parameter names must start with a letter or an underscore and must only contain chars a-z, A-Z, 0-9, or underscores with a maximum length of 64. Example with 1 required and 1 optional parameter: type: OBJECT properties: param1: type: STRING param2: type: INTEGER required: - param1 | 
| parameters_json_schema | Optional. Describes the parameters to the function in JSON Schema format. The schema must describe an object where the properties are the parameters to the function. For example: This field is mutually exclusive with  | 
| response | Optional. Describes the output from this function in JSON Schema format. Reflects the Open API 3.03 Response Object. The Schema defines the type used for the response value of the function. | 
| response_json_schema | Optional. Describes the output from this function in JSON Schema format. The value specified by the schema is the response value of the function. This field is mutually exclusive with  | 
FunctionResponse
The result output from a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function is used as context to the model. This should contain the result of a [FunctionCall] made based on model prediction.
| Fields | |
|---|---|
| id | 
 Optional. The id of the function call this response is for. Populated by the client to match the corresponding function call  | 
| name | 
 Required. The name of the function to call. Matches [FunctionDeclaration.name] and [FunctionCall.name]. | 
| response | Required. The function response in JSON object format. Use "output" key to specify function output and "error" key to specify error details (if any). If "output" and "error" keys are not specified, then whole "response" is treated as function output. | 
| parts[] | Optional. Ordered  | 
FunctionResponseBlob
Raw media bytes for function response.
Text should not be sent as raw bytes, use the 'text' field.
| Fields | |
|---|---|
| mime_type | 
 Required. The IANA standard MIME type of the source data. | 
| data | 
 Required. Raw bytes. | 
| display_name | 
 Optional. Display name of the blob. Used to provide a label or filename to distinguish blobs. This field is only returned in PromptMessage for prompt management. It is currently used in the Gemini GenerateContent calls only when server side tools (code_execution, google_search, and url_context) are enabled. | 
FunctionResponseFileData
URI based data for function response.
| Fields | |
|---|---|
| mime_type | 
 Required. The IANA standard MIME type of the source data. | 
| file_uri | 
 Required. URI. | 
| display_name | 
 Optional. Display name of the file data. Used to provide a label or filename to distinguish file datas. This field is only returned in PromptMessage for prompt management. It is currently used in the Gemini GenerateContent calls only when server side tools (code_execution, google_search, and url_context) are enabled. | 
FunctionResponsePart
A datatype containing media that is part of a FunctionResponse message.
A FunctionResponsePart consists of data which has an associated datatype. A FunctionResponsePart can only contain one of the accepted types in FunctionResponsePart.data.
A FunctionResponsePart must have a fixed IANA MIME type identifying the type and subtype of the media if the inline_data field is filled with raw bytes.
| Fields | |
|---|---|
| Union field data. The data of the function response part.datacan be only one of the following: | |
| inline_data | Inline media bytes. | 
| file_data | URI based data. | 
GcsDestination
The Google Cloud Storage location where the output is to be written to.
| Fields | |
|---|---|
| output_uri_prefix | 
 Required. Google Cloud Storage URI to output directory. If the uri doesn't end with '/', a '/' will be automatically appended. The directory is created if it doesn't exist. | 
GcsSource
The Google Cloud Storage location for the input content.
| Fields | |
|---|---|
| uris[] | 
 Required. Google Cloud Storage URI(-s) to the input file(s). May contain wildcards. For more information on wildcards, see https://cloud.google.com/storage/docs/wildcards. | 
GeminiPreferenceExample
Input example for preference optimization.
| Fields | |
|---|---|
| contents[] | Multi-turn contents that represents the Prompt. | 
| completions[] | List of completions for a given prompt. | 
Completion
Completion and its preference score.
| Fields | |
|---|---|
| completion | Single turn completion for the given prompt. | 
| score | 
 The score for the given completion. | 
GenerateContentRequest
Request message for [PredictionService.GenerateContent].
| Fields | |
|---|---|
| model | 
 Required. The fully qualified name of the publisher model or tuned model endpoint to use. Publisher model format:  Tuned model endpoint format:  | 
| contents[] | Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request. | 
| cached_content | 
 Optional. The name of the cached content used as context to serve the prediction. Note: only used in explicit caching, where users can have control over caching (e.g. what content to cache) and enjoy guaranteed cost savings. Format:  | 
| tools[] | Optional. A list of  A  | 
| tool_config | Optional. Tool config. This config is shared for all tools provided in the request. | 
| labels | 
 Optional. The labels with user-defined metadata for the request. It is used for billing and reporting only. Label keys and values can be no longer than 63 characters (Unicode codepoints) and can only contain lowercase letters, numeric characters, underscores, and dashes. International characters are allowed. Label values are optional. Label keys must start with a letter. | 
| safety_settings[] | Optional. Per request settings for blocking unsafe content. Enforced on GenerateContentResponse.candidates. | 
| model_armor_config | Optional. Settings for prompt and response sanitization using the Model Armor service. If supplied, safety_settings must not be supplied. | 
| generation_config | Optional. Generation config. | 
| system_instruction | Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph. | 
GenerateContentResponse
Response message for [PredictionService.GenerateContent].
| Fields | |
|---|---|
| candidates[] | Output only. Generated candidates. | 
| model_version | 
 Output only. The model version used to generate the response. | 
| create_time | Output only. Timestamp when the request is made to the server. | 
| response_id | 
 Output only. response_id is used to identify each response. It is the encoding of the event_id. | 
| prompt_feedback | Output only. Content filter results for a prompt sent in the request. Note: Sent only in the first stream chunk. Only happens when no candidates were generated due to content violations. | 
| usage_metadata | Usage metadata about the response(s). | 
PromptFeedback
Content filter results for a prompt sent in the request. Note: This is sent only in the first stream chunk and only if no candidates were generated due to content violations.
| Fields | |
|---|---|
| block_reason | Output only. The reason why the prompt was blocked. | 
| safety_ratings[] | Output only. A list of safety ratings for the prompt. There is one rating per category. | 
| block_reason_message | 
 Output only. A readable message that explains the reason why the prompt was blocked. | 
BlockedReason
The reason why the prompt was blocked.
| Enums | |
|---|---|
| BLOCKED_REASON_UNSPECIFIED | The blocked reason is unspecified. | 
| SAFETY | The prompt was blocked for safety reasons. | 
| OTHER | The prompt was blocked for other reasons. For example, it may be due to the prompt's language, or because it contains other harmful content. | 
| BLOCKLIST | The prompt was blocked because it contains a term from the terminology blocklist. | 
| PROHIBITED_CONTENT | The prompt was blocked because it contains prohibited content. | 
| MODEL_ARMOR | The prompt was blocked by Model Armor. | 
| IMAGE_SAFETY | The prompt was blocked because it contains content that is unsafe for image generation. | 
| JAILBREAK | The prompt was blocked as a jailbreak attempt. | 
UsageMetadata
Usage metadata about the content generation request and response. This message provides a detailed breakdown of token usage and other relevant metrics.
| Fields | |
|---|---|
| prompt_token_count | 
 The total number of tokens in the prompt. This includes any text, images, or other media provided in the request. When  | 
| candidates_token_count | 
 The total number of tokens in the generated candidates. | 
| total_token_count | 
 The total number of tokens for the entire request. This is the sum of  | 
| tool_use_prompt_token_count | 
 Output only. The number of tokens in the results from tool executions, which are provided back to the model as input, if applicable. | 
| thoughts_token_count | 
 Output only. The number of tokens that were part of the model's generated "thoughts" output, if applicable. | 
| cached_content_token_count | 
 Output only. The number of tokens in the cached content that was used for this request. | 
| prompt_tokens_details[] | Output only. A detailed breakdown of the token count for each modality in the prompt. | 
| cache_tokens_details[] | Output only. A detailed breakdown of the token count for each modality in the cached content. | 
| candidates_tokens_details[] | Output only. A detailed breakdown of the token count for each modality in the generated candidates. | 
| tool_use_prompt_tokens_details[] | Output only. A detailed breakdown by modality of the token counts from the results of tool executions, which are provided back to the model as input. | 
| traffic_type | Output only. The traffic type for this request. | 
TrafficType
The type of traffic that this request was processed with, indicating which quota is consumed.
| Enums | |
|---|---|
| TRAFFIC_TYPE_UNSPECIFIED | Unspecified request traffic type. | 
| ON_DEMAND | The request was processed using Pay-As-You-Go quota. | 
| PROVISIONED_THROUGHPUT | Type for Provisioned Throughput traffic. | 
GenerateInstanceRubricsRequest
Request message for EvaluationService.GenerateInstanceRubrics.
| Fields | |
|---|---|
| location | 
 Required. The resource name of the Location to generate rubrics from. Format:  | 
| contents[] | Required. The prompt to generate rubrics from. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request. | 
| predefined_rubric_generation_spec | Optional. Specification for using the rubric generation configs of a pre-defined metric, e.g. "generic_quality_v1" and "instruction_following_v1". Some of the configs may be only used in rubric generation and not supporting evaluation, e.g. "fully_customized_generic_quality_v1". If this field is set, the  | 
| rubric_generation_spec | Optional. Specification for how the rubrics should be generated. | 
| agent_config | Optional. Agent configuration, required for agent-based rubric generation. | 
GenerateInstanceRubricsResponse
Response message for EvaluationService.GenerateInstanceRubrics.
| Fields | |
|---|---|
| generated_rubrics[] | Output only. A list of generated rubrics. | 
GenerateMemoriesOperationMetadata
Details of MemoryBankService.GenerateMemories operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
GenerateMemoriesRequest
Request message for MemoryBankService.GenerateMemories. Maximum size is 8 MB.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the ReasoningEngine to generate memories for. Format:  | 
| disable_consolidation | 
 Optional. If true, generated memories will not be consolidated with existing memories; all generated memories will be added as new memories regardless of whether they are duplicates of or contradictory to existing memories. By default, memory consolidation is enabled. | 
| scope | 
 Optional. The scope of the memories that should be generated. Memories will be consolidated across memories with the same scope. Must be provided unless the scope is defined in the source content. If  | 
| revision_labels | 
 Optional. Labels to be applied to the generated memory revisions. For example, you can use this to label a revision with its data source. | 
| Union field source. Source content used to generate memories.sourcecan be only one of the following: | |
| vertex_session_source | Defines a Vertex Session as the source content from which to generate memories. | 
| direct_contents_source | Defines a direct source of content as the source content from which to generate memories. | 
| direct_memories_source | Defines a direct source of memories that should be uploaded to Memory Bank. This is similar to  | 
| Union field revision_expiration. The expiration of the Memory Revisions created as a result of this request. If not set, Memory Bank will defer toMemoryBankConfig.memory_revision_default_ttlor the global default, 365 days.revision_expirationcan be only one of the following: | |
| revision_expire_time | Optional. Timestamp of when the revision is considered expired. If not set, the memory revision will be kept until manually deleted. | 
| revision_ttl | Optional. The TTL for the revision. The expiration time is computed: now + TTL. | 
| disable_memory_revisions | 
 Optional. If true, no revisions will be created for this request. | 
DirectContentsSource
Defines a direct source of content from which to generate the memories.
| Fields | |
|---|---|
| events[] | Required. The source content (i.e. chat history) to generate memories from. | 
Event
A single piece of conversation from which to generate memories.
| Fields | |
|---|---|
| content | Required. A single piece of content from which to generate memories. | 
DirectMemoriesSource
Defines a direct source of memories that should be uploaded to Memory Bank with consolidation.
| Fields | |
|---|---|
| direct_memories[] | Required. The direct memories to upload to Memory Bank. At most 5 direct memories are allowed per request. | 
DirectMemory
A direct memory to upload to Memory Bank.
| Fields | |
|---|---|
| fact | 
 Required. The fact to consolidate with existing memories. | 
| topics[] | Optional. The topics that the consolidated memories should be associated with. | 
VertexSessionSource
Defines an Agent Engine Session from which to generate the memories. If scope is not provided, the scope will be extracted from the Session (i.e. {"user_id": sesison.user_id}).
| Fields | |
|---|---|
| session | 
 Required. The resource name of the Session to generate memories for. Format:  | 
| start_time | Optional. Time range to define which session events should be used to generate memories. Start time (inclusive) of the time range. If not set, the start time is unbounded. | 
| end_time | Optional. End time (exclusive) of the time range. If not set, the end time is unbounded. | 
GenerateMemoriesResponse
Response message for MemoryBankService.GenerateMemories.
| Fields | |
|---|---|
| generated_memories[] | The generated memories. | 
GeneratedMemory
A memory generated by the operation.
| Fields | |
|---|---|
| memory | The generated Memory. | 
| action | The action that was performed on the Memory. | 
| previous_revision | 
 The previous revision of the Memory before the action was performed. This field is only set if the action is  | 
Action
Actions that can be performed on a Memory.
| Enums | |
|---|---|
| ACTION_UNSPECIFIED | Action is unspecified. | 
| CREATED | The memory was created. | 
| UPDATED | The memory was updated. The factfield may not be updated if the existing fact is still accurate. | 
| DELETED | The memory was deleted. | 
GenerateSyntheticDataRequest
Request message for DataFoundryService.GenerateSyntheticData.
| Fields | |
|---|---|
| location | 
 Required. The resource name of the Location to run the job. Format:  | 
| count | 
 Required. The number of synthetic examples to generate. For this stateless API, the count is limited to a small number. | 
| output_field_specs[] | Required. The schema of the desired output, defined by a list of fields. | 
| examples[] | Optional. A list of few-shot examples to guide the model's output style and format. | 
| Union field strategy. The generation strategy to use.strategycan be only one of the following: | |
| task_description | Generate data from a high-level task description. | 
GenerateSyntheticDataResponse
The response containing the generated data.
| Fields | |
|---|---|
| synthetic_examples[] | A list of generated synthetic examples. | 
GenerationConfig
Generation config.
| Fields | |
|---|---|
| stop_sequences[] | 
 Optional. Stop sequences. | 
| response_mime_type | 
 Optional. Output response mimetype of the generated candidate text. Supported mimetype: -  | 
| response_modalities[] | Optional. The modalities of the response. | 
| thinking_config | Optional. Config for thinking features. An error will be returned if this field is set for models that don't support thinking. | 
| model_config | Optional. Config for model selection. | 
| temperature | 
 Optional. Controls the randomness of predictions. | 
| top_p | 
 Optional. If specified, nucleus sampling will be used. | 
| top_k | 
 Optional. If specified, top-k sampling will be used. | 
| candidate_count | 
 Optional. Number of candidates to generate. | 
| max_output_tokens | 
 Optional. The maximum number of output tokens to generate per message. | 
| response_logprobs | 
 Optional. If true, export the logprobs results in response. | 
| logprobs | 
 Optional. Logit probabilities. | 
| presence_penalty | 
 Optional. Positive penalties. | 
| frequency_penalty | 
 Optional. Frequency penalties. | 
| seed | 
 Optional. Seed. | 
| response_schema | Optional. The  | 
| response_json_schema | Optional. Output schema of the generated response. This is an alternative to  If set,  While the full JSON Schema may be sent, not all features are supported. Specifically, only the following properties are supported: 
 The non-standard  Cyclic references are unrolled to a limited degree and, as such, may only be used within non-required properties. (Nullable properties are not sufficient.) If  | 
| routing_config | Optional. Routing configuration. | 
| audio_timestamp | 
 Optional. If enabled, audio timestamp will be included in the request to the model. | 
| media_resolution | Optional. If specified, the media resolution specified will be used. | 
| speech_config | Optional. The speech generation config. | 
| enable_affective_dialog | 
 Optional. If enabled, the model will detect emotions and adapt its responses accordingly. | 
| image_config | Optional. Config for image generation features. | 
MediaResolution
Media resolution for the input media.
| Enums | |
|---|---|
| MEDIA_RESOLUTION_UNSPECIFIED | Media resolution has not been set. | 
| MEDIA_RESOLUTION_LOW | Media resolution set to low (64 tokens). | 
| MEDIA_RESOLUTION_MEDIUM | Media resolution set to medium (256 tokens). | 
| MEDIA_RESOLUTION_HIGH | Media resolution set to high (zoomed reframing with 256 tokens). | 
Modality
The modalities of the response.
| Enums | |
|---|---|
| MODALITY_UNSPECIFIED | Unspecified modality. Will be processed as text. | 
| TEXT | Text modality. | 
| IMAGE | Image modality. | 
| AUDIO | Audio modality. | 
ModelConfig
Config for model selection.
| Fields | |
|---|---|
| feature_selection_preference | Required. Feature selection preference. | 
FeatureSelectionPreference
Options for feature selection preference.
| Enums | |
|---|---|
| FEATURE_SELECTION_PREFERENCE_UNSPECIFIED | Unspecified feature selection preference. | 
| PRIORITIZE_QUALITY | Prefer higher quality over lower cost. | 
| BALANCED | Balanced feature selection preference. | 
| PRIORITIZE_COST | Prefer lower cost over higher quality. | 
RoutingConfig
The configuration for routing the request to a specific model.
| Fields | |
|---|---|
| Union field routing_config. Routing mode.routing_configcan be only one of the following: | |
| auto_mode | Automated routing. | 
| manual_mode | Manual routing. | 
AutoRoutingMode
When automated routing is specified, the routing will be determined by the pretrained routing model and customer provided model routing preference.
| Fields | |
|---|---|
| model_routing_preference | The model routing preference. | 
ModelRoutingPreference
The model routing preference.
| Enums | |
|---|---|
| UNKNOWN | Unspecified model routing preference. | 
| PRIORITIZE_QUALITY | Prefer higher quality over low cost. | 
| BALANCED | Balanced model routing preference. | 
| PRIORITIZE_COST | Prefer lower cost over higher quality. | 
ManualRoutingMode
When manual routing is set, the specified model will be used directly.
| Fields | |
|---|---|
| model_name | 
 The model name to use. Only the public LLM models are accepted. See Supported models. | 
ThinkingConfig
Config for thinking features.
| Fields | |
|---|---|
| include_thoughts | 
 Optional. Indicates whether to include thoughts in the response. If true, thoughts are returned only when available. | 
| thinking_budget | 
 Optional. Indicates the thinking budget in tokens. | 
GenericOperationMetadata
Generic Metadata shared by all operations.
| Fields | |
|---|---|
| partial_failures[] | Output only. Partial failures encountered. E.g. single files that couldn't be read. This field should never exceed 20 entries. Status details field will contain standard Google Cloud error details. | 
| create_time | Output only. Time when the operation was created. | 
| update_time | Output only. Time when the operation was updated for the last time. If the operation has finished (successfully or not), this is the finish time. | 
GetCacheConfigRequest
Request message for getting a cache config.
| Fields | |
|---|---|
| name | 
 Required. Name of the cache config. Format:  -  | 
GetCachedContentRequest
Request message for GenAiCacheService.GetCachedContent.
| Fields | |
|---|---|
| name | 
 Required. The resource name referring to the cached content | 
GetEvaluationItemRequest
Request message for EvaluationManagementService.GetEvaluationItem.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationItem resource. Format:  | 
GetEvaluationRunRequest
Request message for EvaluationManagementService.GetEvaluationRun.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationRun resource. Format:  | 
GetEvaluationSetRequest
Request message for EvaluationManagementService.GetEvaluationSet.
| Fields | |
|---|---|
| name | 
 Required. The name of the EvaluationSet resource. Format:  | 
GetExtensionRequest
Request message for ExtensionRegistryService.GetExtension.
| Fields | |
|---|---|
| name | 
 Required. The name of the Extension resource. Format:  | 
GetMemoryRequest
Request message for MemoryBankService.GetMemory.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the Memory. Format:  | 
GetMemoryRevisionRequest
Request message for MemoryBankService.GetMemoryRevision.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the Memory Revision to retrieve. Format:  | 
GetRagCorpusRequest
Request message for VertexRagDataService.GetRagCorpus
| Fields | |
|---|---|
| name | 
 Required. The name of the RagCorpus resource. Format:  | 
GetRagEngineConfigRequest
Request message for VertexRagDataService.GetRagEngineConfig
| Fields | |
|---|---|
| name | 
 Required. The name of the RagEngineConfig resource. Format:  | 
GetRagFileRequest
Request message for VertexRagDataService.GetRagFile
| Fields | |
|---|---|
| name | 
 Required. The name of the RagFile resource. Format:  | 
GetReasoningEngineRequest
Request message for ReasoningEngineService.GetReasoningEngine.
| Fields | |
|---|---|
| name | 
 Required. The name of the ReasoningEngine resource. Format:  | 
GetSessionRequest
Request message for SessionService.GetSession.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the session. Format:  | 
GetTuningJobRequest
Request message for GenAiTuningService.GetTuningJob.
| Fields | |
|---|---|
| name | 
 Required. The name of the TuningJob resource. Format:  | 
GoAway
Server will not be able to service client soon.
| Fields | |
|---|---|
| time_left | The remaining time before the connection will be terminated as ABORTED. The minimal time returned here is specified differently together with the rate limits for a given model. | 
GoogleDriveSource
The Google Drive location for the input content.
| Fields | |
|---|---|
| resource_ids[] | Required. Google Drive resource IDs. | 
ResourceId
The type and ID of the Google Drive resource.
| Fields | |
|---|---|
| resource_type | Required. The type of the Google Drive resource. | 
| resource_id | 
 Required. The ID of the Google Drive resource. | 
ResourceType
The type of the Google Drive resource.
| Enums | |
|---|---|
| RESOURCE_TYPE_UNSPECIFIED | Unspecified resource type. | 
| RESOURCE_TYPE_FILE | File resource type. | 
| RESOURCE_TYPE_FOLDER | Folder resource type. | 
GoogleMaps
Tool to retrieve public maps data for grounding, powered by Google.
| Fields | |
|---|---|
| enable_widget | 
 Optional. If true, include the widget context token in the response. | 
GoogleSearchRetrieval
Tool to retrieve public web data for grounding, powered by Google.
| Fields | |
|---|---|
| dynamic_retrieval_config | Specifies the dynamic retrieval configuration for the given source. | 
GroundednessInput
Input for groundedness metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for groundedness metric. | 
| instance | Required. Groundedness instance. | 
GroundednessInstance
Spec for groundedness instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| context | 
 Required. Background information provided in context used to compare against the prediction. | 
GroundednessResult
Spec for groundedness result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for groundedness score. | 
| score | 
 Output only. Groundedness score. | 
| confidence | 
 Output only. Confidence for groundedness score. | 
GroundednessSpec
Spec for groundedness metric.
| Fields | |
|---|---|
| version | 
 Optional. Which version to use for evaluation. | 
GroundingChunk
Grounding chunk.
| Fields | |
|---|---|
| Union field chunk_type. Chunk type.chunk_typecan be only one of the following: | |
| web | Grounding chunk from the web. | 
| retrieved_context | Grounding chunk from context retrieved by the retrieval tools. | 
| maps | Grounding chunk from Google Maps. | 
Maps
Chunk from Google Maps.
| Fields | |
|---|---|
| place_answer_sources | Sources used to generate the place answer. This includes review snippets and photos that were used to generate the answer, as well as uris to flag content. | 
| uri | 
 URI reference of the place. | 
| title | 
 Title of the place. | 
| text | 
 Text of the place answer. | 
| place_id | 
 This Place's resource name, in  | 
PlaceAnswerSources
Sources used to generate the place answer.
| Fields | |
|---|---|
| review_snippets[] | Snippets of reviews that are used to generate the answer. | 
ReviewSnippet
Encapsulates a review snippet.
| Fields | |
|---|---|
| review_id | 
 Id of the review referencing the place. | 
| google_maps_uri | 
 A link to show the review on Google Maps. | 
| title | 
 Title of the review. | 
RetrievedContext
Chunk from context retrieved by the retrieval tools.
| Fields | |
|---|---|
| Union field context_details. Tool-specific details about the retrieved context.context_detailscan be only one of the following: | |
| rag_chunk | Additional context for the RAG retrieval result. This is only populated when using the RAG retrieval tool. | 
| uri | 
 URI reference of the attribution. | 
| title | 
 Title of the attribution. | 
| text | 
 Text of the attribution. | 
| document_name | 
 Output only. The full document name for the referenced Vertex AI Search document. | 
Web
Chunk from the web.
| Fields | |
|---|---|
| uri | 
 URI reference of the chunk. | 
| title | 
 Title of the chunk. | 
| domain | 
 Domain of the (original) URI. | 
GroundingMetadata
Metadata returned to client when grounding is enabled.
| Fields | |
|---|---|
| web_search_queries[] | 
 Optional. Web search queries for the following-up web search. | 
| retrieval_queries[] | 
 Optional. Queries executed by the retrieval tools. | 
| grounding_chunks[] | List of supporting references retrieved from specified grounding source. | 
| grounding_supports[] | Optional. List of grounding support. | 
| source_flagging_uris[] | Optional. Output only. List of source flagging uris. This is currently populated only for Google Maps grounding. | 
| search_entry_point | Optional. Google search entry for the following-up web searches. | 
| retrieval_metadata | Optional. Output only. Retrieval metadata. | 
| google_maps_widget_context_token | 
 Optional. Output only. Resource name of the Google Maps widget context token to be used with the PlacesContextElement widget to render contextual data. This is populated only for Google Maps grounding. | 
SourceFlaggingUri
Source content flagging uri for a place or review. This is currently populated only for Google Maps grounding.
| Fields | |
|---|---|
| source_id | 
 Id of the place or review. | 
| flag_content_uri | 
 A link where users can flag a problem with the source (place or review). | 
GroundingSupport
Grounding support.
| Fields | |
|---|---|
| grounding_chunk_indices[] | 
 A list of indices (into 'grounding_chunk') specifying the citations associated with the claim. For instance [1,3,4] means that grounding_chunk[1], grounding_chunk[3], grounding_chunk[4] are the retrieved content attributed to the claim. | 
| confidence_scores[] | 
 Confidence score of the support references. Ranges from 0 to 1. 1 is the most confident. For Gemini 2.0 and before, this list must have the same size as the grounding_chunk_indices. For Gemini 2.5 and after, this list will be empty and should be ignored. | 
| segment | Segment of the content this support belongs to. | 
HarmCategory
Harm categories that will block the content.
| Enums | |
|---|---|
| HARM_CATEGORY_UNSPECIFIED | The harm category is unspecified. | 
| HARM_CATEGORY_HATE_SPEECH | The harm category is hate speech. | 
| HARM_CATEGORY_DANGEROUS_CONTENT | The harm category is dangerous content. | 
| HARM_CATEGORY_HARASSMENT | The harm category is harassment. | 
| HARM_CATEGORY_SEXUALLY_EXPLICIT | The harm category is sexually explicit content. | 
| HARM_CATEGORY_CIVIC_INTEGRITY | Deprecated: Election filter is not longer supported. The harm category is civic integrity. | 
| HARM_CATEGORY_IMAGE_HATE | The harm category is image hate. | 
| HARM_CATEGORY_IMAGE_DANGEROUS_CONTENT | The harm category is image dangerous content. | 
| HARM_CATEGORY_IMAGE_HARASSMENT | The harm category is image harassment. | 
| HARM_CATEGORY_IMAGE_SEXUALLY_EXPLICIT | The harm category is image sexually explicit content. | 
| HARM_CATEGORY_JAILBREAK | The harm category is for jailbreak prompts. | 
HttpElementLocation
Enum of location an HTTP element can be.
| Enums | |
|---|---|
| HTTP_IN_UNSPECIFIED | |
| HTTP_IN_QUERY | Element is in the HTTP request query. | 
| HTTP_IN_HEADER | Element is in the HTTP request header. | 
| HTTP_IN_PATH | Element is in the HTTP request path. | 
| HTTP_IN_BODY | Element is in the HTTP request body. | 
| HTTP_IN_COOKIE | Element is in the HTTP request cookie. | 
ImageConfig
Config for image generation features.
| Fields | |
|---|---|
| image_output_options | Optional. The image output format for generated images. | 
| aspect_ratio | 
 Optional. The desired aspect ratio for the generated images. The following aspect ratios are supported: "1:1" "2:3", "3:2" "3:4", "4:3" "4:5", "5:4" "9:16", "16:9" "21:9" | 
| person_generation | Optional. Controls whether the model can generate people. | 
ImageOutputOptions
The image output format for generated images.
| Fields | |
|---|---|
| mime_type | 
 Optional. The image format that the output should be saved as. | 
| compression_quality | 
 Optional. The compression quality of the output image. | 
PersonGeneration
Enum that controls the generation of people.
| Enums | |
|---|---|
| PERSON_GENERATION_UNSPECIFIED | Generation images of people unspecified. | 
| ALLOW_ALL | Generate images that include adults and children. | 
| ALLOW_ADULT | Generate images of adults, but not children. | 
| ALLOW_NONE | Block generation of images of people. | 
ImportExtensionOperationMetadata
Details of ExtensionRegistryService.ImportExtension operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
ImportExtensionRequest
Request message for ExtensionRegistryService.ImportExtension.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to import the Extension in. Format:  | 
| extension | Required. The Extension to import. | 
ImportRagFilesConfig
Config for importing RagFiles.
| Fields | |
|---|---|
| rag_file_chunking_config | Specifies the size and overlap of chunks after importing RagFiles. | 
| rag_file_transformation_config | Specifies the transformation config for RagFiles. | 
| rag_file_parsing_config | Optional. Specifies the parsing config for RagFiles. RAG will use the default parser if this field is not set. | 
| rag_file_metadata_config | Specifies the metadata config for RagFiles. Including paths for metadata schema and metadata. | 
| max_embedding_requests_per_min | 
 Optional. The max number of queries per minute that this job is allowed to make to the embedding model specified on the corpus. This value is specific to this job and not shared across other import jobs. Consult the Quotas page on the project to set an appropriate value here. If unspecified, a default value of 1,000 QPM would be used. | 
| global_max_embedding_requests_per_min | 
 Optional. The max number of queries per minute that the indexing pipeline job is allowed to make to the embedding model specified in the project. Please follow the quota usage guideline of the embedding model you use to set the value properly.If this value is not specified, max_embedding_requests_per_min will be used by indexing pipeline job as the global limit. | 
| rebuild_ann_index | 
 Rebuilds the ANN index to optimize for recall on the imported data. Only applicable for RagCorpora running on RagManagedDb with  Default is false, i.e., index is not rebuilt. | 
| Union field import_source. The source of the import.import_sourcecan be only one of the following: | |
| gcs_source | Google Cloud Storage location. Supports importing individual files as well as entire Google Cloud Storage directories. Sample formats: -  | 
| google_drive_source | Google Drive location. Supports importing individual files as well as Google Drive folders. | 
| slack_source | Slack channels with their corresponding access tokens. | 
| jira_source | Jira queries with their corresponding authentication. | 
| share_point_sources | SharePoint sources. | 
| Union field partial_failure_sink. Optional. If provided, all partial failures are written to the sink. Deprecated. Prefer to use theimport_result_sink.partial_failure_sinkcan be only one of the following: | |
| partial_failure_gcs_sink | The Cloud Storage path to write partial failures to. Deprecated. Prefer to use  | 
| partial_failure_bigquery_sink | The BigQuery destination to write partial failures to. It should be a bigquery table resource name (e.g. "bq://projectId.bqDatasetId.bqTableId"). The dataset must exist. If the table does not exist, it will be created with the expected schema. If the table exists, the schema will be validated and data will be added to this existing table. Deprecated. Prefer to use  | 
| Union field import_result_sink. Optional. If provided, all successfully imported files and all partial failures are written to the sink.import_result_sinkcan be only one of the following: | |
| import_result_gcs_sink | The Cloud Storage path to write import result to. | 
| import_result_bigquery_sink | The BigQuery destination to write import result to. It should be a bigquery table resource name (e.g. "bq://projectId.bqDatasetId.bqTableId"). The dataset must exist. If the table does not exist, it will be created with the expected schema. If the table exists, the schema will be validated and data will be added to this existing table. | 
ImportRagFilesOperationMetadata
Runtime operation information for VertexRagDataService.ImportRagFiles.
| Fields | |
|---|---|
| generic_metadata | The operation generic information. | 
| rag_corpus_id | 
 The resource ID of RagCorpus that this operation is executed on. | 
| import_rag_files_config | Output only. The config that was passed in the ImportRagFilesRequest. | 
| progress_percentage | 
 The progress percentage of the operation. Value is in the range [0, 100]. This percentage is calculated as follows: progress_percentage = 100 * (successes + failures + skips) / total | 
ImportRagFilesRequest
Request message for VertexRagDataService.ImportRagFiles.
| Fields | |
|---|---|
| parent | 
 Required. The name of the RagCorpus resource into which to import files. Format:  | 
| import_rag_files_config | Required. The config for the RagFiles to be synced and imported into the RagCorpus.  | 
ImportRagFilesResponse
Response message for VertexRagDataService.ImportRagFiles.
| Fields | |
|---|---|
| imported_rag_files_count | 
 The number of RagFiles that had been imported into the RagCorpus. | 
| failed_rag_files_count | 
 The number of RagFiles that had failed while importing into the RagCorpus. | 
| skipped_rag_files_count | 
 The number of RagFiles that was skipped while importing into the RagCorpus. | 
| Union field partial_failure_sink. The location into which the partial failures were written.partial_failure_sinkcan be only one of the following: | |
| partial_failures_gcs_path | 
 The Google Cloud Storage path into which the partial failures were written. | 
| partial_failures_bigquery_table | 
 The BigQuery table into which the partial failures were written. | 
IntermediateExtractedMemory
An extracted memory that is the intermediate result before consolidation.
| Fields | |
|---|---|
| fact | 
 Output only. The fact of the extracted memory. | 
InvokeRequest
Request message for PredictionService.Invoke.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| deployed_model_id | 
 ID of the DeployedModel that serves the invoke request. | 
| http_body | The invoke method input. Supports HTTP headers and arbitrary data payload. | 
JiraSource
The Jira source for the ImportRagFilesRequest.
| Fields | |
|---|---|
| jira_queries[] | Required. The Jira queries. | 
JiraQueries
JiraQueries contains the Jira queries and corresponding authentication.
| Fields | |
|---|---|
| projects[] | 
 A list of Jira projects to import in their entirety. | 
| custom_queries[] | 
 A list of custom Jira queries to import. For information about JQL (Jira Query Language), see https://support.atlassian.com/jira-service-management-cloud/docs/use-advanced-search-with-jira-query-language-jql/ | 
| email | 
 Required. The Jira email address. | 
| server_uri | 
 Required. The Jira server URI. | 
| api_key_config | Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Jira API key. See Manage API tokens for your Atlassian account. | 
JobState
Describes the state of a job.
| Enums | |
|---|---|
| JOB_STATE_UNSPECIFIED | The job state is unspecified. | 
| JOB_STATE_QUEUED | The job has been just created or resumed and processing has not yet begun. | 
| JOB_STATE_PENDING | The service is preparing to run the job. | 
| JOB_STATE_RUNNING | The job is in progress. | 
| JOB_STATE_SUCCEEDED | The job completed successfully. | 
| JOB_STATE_FAILED | The job failed. | 
| JOB_STATE_CANCELLING | The job is being cancelled. From this state the job may only go to either JOB_STATE_SUCCEEDED,JOB_STATE_FAILEDorJOB_STATE_CANCELLED. | 
| JOB_STATE_CANCELLED | The job has been cancelled. | 
| JOB_STATE_PAUSED | The job has been stopped, and can be resumed. | 
| JOB_STATE_EXPIRED | The job has expired. | 
| JOB_STATE_UPDATING | The job is being updated. Only jobs in the RUNNINGstate can be updated. After updating, the job goes back to theRUNNINGstate. | 
| JOB_STATE_PARTIALLY_SUCCEEDED | The job is partially succeeded, some results may be missing due to errors. | 
LLMBasedMetricSpec
Specification for an LLM based metric.
| Fields | |
|---|---|
| Union field rubrics_source. Source of the rubrics to be used for evaluation.rubrics_sourcecan be only one of the following: | |
| rubric_group_key | 
 Use a pre-defined group of rubrics associated with the input. Refers to a key in the rubric_groups map of EvaluationInstance. | 
| rubric_generation_spec | Dynamically generate rubrics using this specification. | 
| predefined_rubric_generation_spec | Dynamically generate rubrics using a predefined spec. | 
| metric_prompt_template | 
 Required. Template for the prompt sent to the judge model. | 
| system_instruction | 
 Optional. System instructions for the judge model. | 
| judge_autorater_config | Optional. Optional configuration for the judge LLM (Autorater). | 
| additional_config | Optional. Optional additional configuration for the metric. | 
ListCachedContentsRequest
Request to list CachedContents.
| Fields | |
|---|---|
| parent | 
 Required. The parent, which owns this collection of cached contents. | 
| page_size | 
 Optional. The maximum number of cached contents to return. The service may return fewer than this value. If unspecified, some default (under maximum) number of items will be returned. The maximum value is 1000; values above 1000 will be coerced to 1000. | 
| page_token | 
 Optional. A page token, received from a previous  When paginating, all other parameters provided to  | 
ListCachedContentsResponse
Response with a list of CachedContents.
| Fields | |
|---|---|
| cached_contents[] | List of cached contents. | 
| next_page_token | 
 A token, which can be sent as  | 
ListEvaluationItemsRequest
Request message for EvaluationManagementService.ListEvaluationItems.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to list the Evaluation Items. Format:  | 
| page_size | 
 Optional. The maximum number of Evaluation Items to return. | 
| page_token | 
 Optional. A page token, received from a previous  | 
| filter | 
 Optional. Filter expression that matches a subset of the EvaluationItems to show. For field names both snake_case and camelCase are supported. For more information about filter syntax, see AIP-160. | 
| order_by | 
 Optional. A comma-separated list of fields to order by, sorted in ascending order by default. Use  | 
ListEvaluationItemsResponse
Response message for EvaluationManagementService.ListEvaluationItems.
| Fields | |
|---|---|
| evaluation_items[] | List of EvaluationItems in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. | 
ListEvaluationRunsRequest
Request message for EvaluationManagementService.ListEvaluationRuns.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to list the Evaluation Runs. Format:  | 
| page_size | 
 Optional. The maximum number of Evaluation Runs to return. | 
| page_token | 
 Optional. A page token, received from a previous  | 
| filter | 
 Optional. Filter expression that matches a subset of the EvaluationRuns to show. For field names both snake_case and camelCase are supported. For more information about filter syntax, see AIP-160. | 
| order_by | 
 Optional. A comma-separated list of fields to order by, sorted in ascending order by default. Use  | 
ListEvaluationRunsResponse
Response message for EvaluationManagementService.ListEvaluationRuns.
| Fields | |
|---|---|
| evaluation_runs[] | List of EvaluationRuns in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. | 
ListEvaluationSetsRequest
Request message for EvaluationManagementService.ListEvaluationSets.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to list the Evaluation Sets. Format:  | 
| page_size | 
 Optional. The maximum number of Evaluation Sets to return. | 
| page_token | 
 Optional. A page token, received from a previous  | 
| filter | 
 Optional. Filter expression that matches a subset of the EvaluationSets to show. For field names both snake_case and camelCase are supported. For more information about filter syntax, see AIP-160. | 
| order_by | 
 Optional. A comma-separated list of fields to order by, sorted in ascending order by default. Use  | 
ListEvaluationSetsResponse
Response message for EvaluationManagementService.ListEvaluationSets.
| Fields | |
|---|---|
| evaluation_sets[] | List of EvaluationSets in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. | 
ListEventsRequest
Request message for SessionService.ListEvents.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the session to list events from. Format:  | 
| page_size | 
 Optional. The maximum number of events to return. The service may return fewer than this value. If unspecified, at most 100 events will be returned. These events are ordered by timestamp in ascending order. | 
| page_token | 
 Optional. The  | 
| filter | 
 Optional. The standard list filter. Supported fields:  *  More detail in AIP-160. | 
ListEventsResponse
Response message for SessionService.ListEvents.
| Fields | |
|---|---|
| session_events[] | A list of events matching the request. Ordered by timestamp in ascending order. | 
| next_page_token | 
 A token, which can be sent as  | 
ListExtensionsRequest
Request message for ExtensionRegistryService.ListExtensions.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to list the Extensions from. Format:  | 
| filter | 
 Optional. The standard list filter. Supported fields:  *  More detail in AIP-160. | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. | 
| order_by | 
 Optional. A comma-separated list of fields to order by, sorted in ascending order. Use "desc" after a field name for descending. Supported fields:  *  Example:  | 
ListExtensionsResponse
Response message for ExtensionRegistryService.ListExtensions
| Fields | |
|---|---|
| extensions[] | List of Extension in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. Pass to  | 
ListMemoriesRequest
Request message for MemoryBankService.ListMemories.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the ReasoningEngine to list the Memories under. Format:  | 
| filter | 
 Optional. The standard list filter. More detail in AIP-160. Supported fields (equality match only):  *  | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. | 
| order_by | 
 Optional. The standard list order by string. If not specified, the default order is  More detail in AIP-132. Supported fields:  *  | 
ListMemoriesResponse
Response message for MemoryBankService.ListMemories.
| Fields | |
|---|---|
| memories[] | List of Memories in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. Pass to  | 
ListMemoryRevisionsRequest
Request message for MemoryBankService.ListMemoryRevisions.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Memory to list revisions for. Format:  | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. | 
| filter | 
 Optional. The standard list filter. More detail in AIP-160. Supported fields (equality match only):  *  | 
ListMemoryRevisionsResponse
Response message for MemoryBankService.ListMemoryRevisions.
| Fields | |
|---|---|
| memory_revisions[] | The list of Memory Revisions in the request page. | 
| next_page_token | 
 A token, which can be sent as  | 
ListRagCorporaRequest
Request message for VertexRagDataService.ListRagCorpora.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to list the RagCorpora. Format:  | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. Typically obtained via  | 
ListRagCorporaResponse
Response message for VertexRagDataService.ListRagCorpora.
| Fields | |
|---|---|
| rag_corpora[] | List of RagCorpora in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. Pass to  | 
ListRagFilesRequest
Request message for VertexRagDataService.ListRagFiles.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the RagCorpus from which to list the RagFiles. Format:  | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. Typically obtained via  | 
ListRagFilesResponse
Response message for VertexRagDataService.ListRagFiles.
| Fields | |
|---|---|
| rag_files[] | List of RagFiles in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. Pass to  | 
ListReasoningEnginesRequest
Request message for ReasoningEngineService.ListReasoningEngines.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to list the ReasoningEngines from. Format:  | 
| filter | 
 Optional. The standard list filter. More detail in AIP-160. | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. | 
ListReasoningEnginesResponse
Response message for ReasoningEngineService.ListReasoningEngines
| Fields | |
|---|---|
| reasoning_engines[] | List of ReasoningEngines in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. Pass to  | 
ListSessionsRequest
Request message for SessionService.ListSessions.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the location to list sessions from. Format:  | 
| page_size | 
 Optional. The maximum number of sessions to return. The service may return fewer than this value. If unspecified, at most 100 sessions will be returned. | 
| page_token | 
 Optional. The  | 
| filter | 
 Optional. The standard list filter. Supported fields:  *  Example:  | 
| order_by | 
 Optional. A comma-separated list of fields to order by, sorted in ascending order. Use "desc" after a field name for descending. Supported fields:  *  Example:  | 
ListSessionsResponse
Response message for SessionService.ListSessions.
| Fields | |
|---|---|
| sessions[] | A list of sessions matching the request. | 
| next_page_token | 
 A token, which can be sent as  | 
ListTuningJobsRequest
Request message for GenAiTuningService.ListTuningJobs.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to list the TuningJobs from. Format:  | 
| filter | 
 Optional. The standard list filter. | 
| page_size | 
 Optional. The standard list page size. | 
| page_token | 
 Optional. The standard list page token. Typically obtained via  | 
ListTuningJobsResponse
Response message for GenAiTuningService.ListTuningJobs
| Fields | |
|---|---|
| tuning_jobs[] | List of TuningJobs in the requested page. | 
| next_page_token | 
 A token to retrieve the next page of results. Pass to  | 
LogprobsResult
Logprobs Result
| Fields | |
|---|---|
| top_candidates[] | Length = total number of decoding steps. | 
| chosen_candidates[] | Length = total number of decoding steps. The chosen candidates may or may not be in top_candidates. | 
Candidate
Candidate for the logprobs token and score.
| Fields | |
|---|---|
| token | 
 The candidate's token string value. | 
| token_id | 
 The candidate's token id value. | 
| log_probability | 
 The candidate's log probability. | 
TopCandidates
Candidates with top log probabilities at each decoding step.
| Fields | |
|---|---|
| candidates[] | Sorted by log probability in descending order. | 
Memory
A memory.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the Memory. Format:  | 
| display_name | 
 Optional. Display name of the Memory. | 
| description | 
 Optional. Description of the Memory. | 
| create_time | Output only. Timestamp when this Memory was created. | 
| update_time | Output only. Timestamp when this Memory was most recently updated. | 
| fact | 
 Required. Semantic knowledge extracted from the source content. | 
| scope | 
 Required. Immutable. The scope of the Memory. Memories are isolated within their scope. The scope is defined when creating or generating memories. Scope values cannot contain the wildcard character '*'. | 
| topics[] | Optional. The Topics of the Memory. | 
| Union field expiration. The expiration of the Memory. If not set, the Memory will not be automatically deleted.expirationcan be only one of the following: | |
| expire_time | Optional. Timestamp of when this resource is considered expired. This is always provided on output when  | 
| ttl | Optional. Input only. The TTL for this resource. The expiration time is computed: now + TTL. | 
| Union field revision_expiration. (Input-only)The expiration of the Memory Revision created as a result of this request. If not set, Memory Bank will defer toMemoryBankConfig.memory_revision_default_ttlor the global default, 365 days.revision_expirationcan be only one of the following: | |
| revision_expire_time | Optional. Input only. Timestamp of when the revision is considered expired. If not set, the memory revision will be kept until manually deleted. | 
| revision_ttl | Optional. Input only. The TTL for the revision. The expiration time is computed: now + TTL. | 
| disable_memory_revisions | 
 Optional. Input only. If true, no revision will be created for this request. | 
MemoryBankCustomizationConfig
Configuration for organizing memories for a particular scope.
| Fields | |
|---|---|
| scope_keys[] | 
 Optional. The scope keys (i.e. 'user_id') for which to use this config. A request's scope must include all of the provided keys for the config to be used (order does not matter). If empty, then the config will be used for all requests that do not have a more specific config. Only one default config is allowed per Memory Bank. | 
| memory_topics[] | Optional. Topics of information that should be extracted from conversations and stored as memories. If not set, then Memory Bank's default topics will be used. | 
| generate_memories_examples[] | Optional. Examples of how to generate memories for a particular scope. | 
GenerateMemoriesExample
An example of how to generate memories for a particular scope.
| Fields | |
|---|---|
| generated_memories[] | Optional. The memories that are expected to be generated from the input conversation. An empty list indicates that no memories are expected to be generated for the input conversation. | 
| Union field source. The input source for the example.sourcecan be only one of the following: | |
| conversation_source | A conversation source for the example. | 
ConversationSource
A conversation source for the example. This is similar to DirectContentsSource.
| Fields | |
|---|---|
| events[] | Optional. The input conversation events for the example. | 
Event
A single conversation event.
| Fields | |
|---|---|
| content | Required. The content of the event. | 
GeneratedMemory
A memory generated by the operation.
| Fields | |
|---|---|
| fact | 
 Required. The fact to generate a memory from. | 
| topics[] | Optional. The list of topics that the memory should be associated with. For example, use  | 
MemoryTopic
A topic of information that should be extracted from conversations and stored as memories.
| Fields | |
|---|---|
| Union field topic_type. The type of the topic.topic_typecan be only one of the following: | |
| custom_memory_topic | A custom memory topic defined by the developer. | 
| managed_memory_topic | A managed memory topic defined by Memory Bank. | 
CustomMemoryTopic
A custom memory topic defined by the developer.
| Fields | |
|---|---|
| label | 
 Required. The label of the topic. | 
| description | 
 Required. Description of the memory topic. This should explain what information should be extracted for this topic. | 
ManagedMemoryTopic
A managed memory topic defined by the system.
| Fields | |
|---|---|
| managed_topic_enum | Required. The managed topic. | 
ManagedTopicEnum
Managed topics.
| Enums | |
|---|---|
| MANAGED_TOPIC_ENUM_UNSPECIFIED | Unspecified topic. This value should not be used. | 
| USER_PERSONAL_INFO | Significant personal information about the User like first names, relationships, hobbies, important dates. | 
| USER_PREFERENCES | Stated or implied likes, dislikes, preferred styles, or patterns. | 
| KEY_CONVERSATION_DETAILS | Important milestones or conclusions within the dialogue. | 
| EXPLICIT_INSTRUCTIONS | Information that the user explicitly requested to remember or forget. | 
MemoryRevision
A revision of a Memory.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the Memory Revision. Format:  | 
| create_time | Output only. Timestamp when this Memory Revision was created. | 
| expire_time | Output only. Timestamp of when this resource is considered expired. | 
| fact | 
 Output only. The fact of the Memory Revision. This corresponds to the  | 
| labels | 
 Output only. The labels of the Memory Revision. These labels are applied to the MemoryRevision when it is created based on  | 
| extracted_memories[] | Output only. The extracted memories from the source content before consolidation when the memory was updated via GenerateMemories. This information was used to modify an existing Memory via Consolidation. | 
MemoryTopicId
A memory topic identifier. This will be used to label a Memory and to restrict which topics are eligible for generation or retrieval.
| Fields | |
|---|---|
| Union field topic_id. Topic identifier.topic_idcan be only one of the following: | |
| custom_memory_topic_label | 
 Optional. The custom memory topic label. | 
| managed_memory_topic | Optional. The managed memory topic. | 
Metric
The metric used for running evaluations.
| Fields | |
|---|---|
| aggregation_metrics[] | Optional. The aggregation metrics to use. | 
| Union field metric_spec. The spec for the metric. It would be either a pre-defined metric, or a inline metric spec.metric_speccan be only one of the following: | |
| predefined_metric_spec | The spec for a pre-defined metric. | 
| llm_based_metric_spec | Spec for an LLM based metric. | 
| pointwise_metric_spec | Spec for pointwise metric. | 
| pairwise_metric_spec | Spec for pairwise metric. | 
| exact_match_spec | Spec for exact match metric. | 
| bleu_spec | Spec for bleu metric. | 
| rouge_spec | Spec for rouge metric. | 
AggregationMetric
The aggregation metrics supported by EvaluationService.EvaluateDataset.
| Enums | |
|---|---|
| AGGREGATION_METRIC_UNSPECIFIED | Unspecified aggregation metric. | 
| AVERAGE | Average aggregation metric. Not supported for Pairwise metric. | 
| MODE | Mode aggregation metric. | 
| STANDARD_DEVIATION | Standard deviation aggregation metric. Not supported for pairwise metric. | 
| VARIANCE | Variance aggregation metric. Not supported for pairwise metric. | 
| MINIMUM | Minimum aggregation metric. Not supported for pairwise metric. | 
| MAXIMUM | Maximum aggregation metric. Not supported for pairwise metric. | 
| MEDIAN | Median aggregation metric. Not supported for pairwise metric. | 
| PERCENTILE_P90 | 90th percentile aggregation metric. Not supported for pairwise metric. | 
| PERCENTILE_P95 | 95th percentile aggregation metric. Not supported for pairwise metric. | 
| PERCENTILE_P99 | 99th percentile aggregation metric. Not supported for pairwise metric. | 
MetricResult
Result for a single metric on a single instance.
| Fields | |
|---|---|
| rubric_verdicts[] | Output only. For rubric-based metrics, the verdicts for each rubric. | 
| score | 
 Output only. The score for the metric. Please refer to each metric's documentation for the meaning of the score. | 
| explanation | 
 Output only. The explanation for the metric result. | 
| error | Output only. The error status for the metric result. | 
MetricxInput
Input for MetricX metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for Metricx metric. | 
| instance | Required. Metricx instance. | 
MetricxInstance
Spec for MetricX instance - The fields used for evaluation are dependent on the MetricX version.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| source | 
 Optional. Source text in original language. | 
MetricxResult
Spec for MetricX result - calculates the MetricX score for the given instance using the version specified in the spec.
| Fields | |
|---|---|
| score | 
 Output only. MetricX score. Range depends on version. | 
MetricxSpec
Spec for MetricX metric.
| Fields | |
|---|---|
| source_language | 
 Optional. Source language in BCP-47 format. | 
| target_language | 
 Optional. Target language in BCP-47 format. Covers both prediction and reference. | 
| version | Required. Which version to use for evaluation. | 
MetricxVersion
MetricX Version options.
| Enums | |
|---|---|
| METRICX_VERSION_UNSPECIFIED | MetricX version unspecified. | 
| METRICX_24_REF | MetricX 2024 (2.6) for translation + reference (reference-based). | 
| METRICX_24_SRC | MetricX 2024 (2.6) for translation + source (QE). | 
| METRICX_24_SRC_REF | MetricX 2024 (2.6) for translation + source + reference (source-reference-combined). | 
Modality
Content Part modality
| Enums | |
|---|---|
| MODALITY_UNSPECIFIED | Unspecified modality. | 
| TEXT | Plain text. | 
| IMAGE | Image. | 
| VIDEO | Video. | 
| AUDIO | Audio. | 
| DOCUMENT | Document, e.g. PDF. | 
ModalityTokenCount
Represents token counting info for a single modality.
| Fields | |
|---|---|
| modality | The modality associated with this token count. | 
| token_count | 
 Number of tokens. | 
ModelArmorConfig
Configuration for Model Armor integrations of prompt and responses.
| Fields | |
|---|---|
| prompt_template_name | 
 Optional. The name of the Model Armor template to use for prompt sanitization. | 
| response_template_name | 
 Optional. The name of the Model Armor template to use for response sanitization. | 
MultiSpeakerVoiceConfig
Configuration for a multi-speaker text-to-speech setup. Enables the use of up to two distinct voices in a single synthesis request.
| Fields | |
|---|---|
| speaker_voice_configs[] | Required. A list of configurations for the voices of the speakers. Exactly two speaker voice configurations must be provided. | 
OptimizePromptRequest
Request message for GenAiTuningService.OptimizePrompt.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location to optimize the prompt in. Format:  | 
| content | Required. The content to optimize. | 
| optimization_target | Optional. The target model to optimize the prompt for. | 
OptimizationTarget
The target model to optimize the prompt for.
| Enums | |
|---|---|
| OPTIMIZATION_TARGET_UNSPECIFIED | Unspecified optimization target. Default to OPTIMIZATION_TARGET_GENERAL. | 
| OPTIMIZATION_TARGET_GENERAL | Optimize the prompt for quality. | 
| OPTIMIZATION_TARGET_GEMINI_NANO | Optimize the prompt for Gemini Nano. | 
OptimizePromptResponse
Response message for GenAiTuningService.OptimizePrompt
| Fields | |
|---|---|
| content | Output only. The optimized prompt content. | 
OutputConfig
Config for evaluation output.
| Fields | |
|---|---|
| Union field destination. The destination for evaluation output.destinationcan be only one of the following: | |
| gcs_destination | Cloud storage destination for evaluation output. | 
OutputFieldSpec
Defines a specification for a single output field.
| Fields | |
|---|---|
| field_name | 
 Required. The name of the output field. | 
| guidance | 
 Optional. Optional, but recommended. Additional guidance specific to this field to provide targeted instructions for the LLM to generate the content of a single output field. While the LLM can sometimes infer content from the field name, providing explicit guidance is preferred. | 
| field_type | Optional. The data type of the field. Defaults to CONTENT if not set. | 
FieldType
The data type of the field.
| Enums | |
|---|---|
| FIELD_TYPE_UNSPECIFIED | Field type is unspecified. | 
| CONTENT | Arbitrary content field type. | 
| TEXT | Text field type. | 
| IMAGE | Image field type. | 
| AUDIO | Audio field type. | 
OutputInfo
Describes the info for output of EvaluationService.EvaluateDataset.
| Fields | |
|---|---|
| Union field output_location. The output location into which evaluation output is written.output_locationcan be only one of the following: | |
| gcs_output_directory | 
 Output only. The full path of the Cloud Storage directory created, into which the evaluation results and aggregation results are written. | 
PairwiseChoice
Pairwise prediction autorater preference.
| Enums | |
|---|---|
| PAIRWISE_CHOICE_UNSPECIFIED | Unspecified prediction choice. | 
| BASELINE | Baseline prediction wins | 
| CANDIDATE | Candidate prediction wins | 
| TIE | Winner cannot be determined | 
PairwiseMetricInput
Input for pairwise metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for pairwise metric. | 
| instance | Required. Pairwise metric instance. | 
PairwiseMetricInstance
Pairwise metric instance. Usually one instance corresponds to one row in an evaluation dataset.
| Fields | |
|---|---|
| Union field instance. Instance for pairwise metric.instancecan be only one of the following: | |
| json_instance | 
 Instance specified as a json string. String key-value pairs are expected in the json_instance to render PairwiseMetricSpec.instance_prompt_template. | 
| content_map_instance | Key-value contents for the mutlimodality input, including text, image, video, audio, and pdf, etc. The key is placeholder in metric prompt template, and the value is the multimodal content. | 
PairwiseMetricResult
Spec for pairwise metric result.
| Fields | |
|---|---|
| pairwise_choice | Output only. Pairwise metric choice. | 
| explanation | 
 Output only. Explanation for pairwise metric score. | 
| custom_output | Output only. Spec for custom output. | 
PairwiseMetricSpec
Spec for pairwise metric.
| Fields | |
|---|---|
| candidate_response_field_name | 
 Optional. The field name of the candidate response. | 
| baseline_response_field_name | 
 Optional. The field name of the baseline response. | 
| custom_output_format_config | Optional. CustomOutputFormatConfig allows customization of metric output. When this config is set, the default output is replaced with the raw output string. If a custom format is chosen, the  | 
| metric_prompt_template | 
 Required. Metric prompt template for pairwise metric. | 
| system_instruction | 
 Optional. System instructions for pairwise metric. | 
PairwiseQuestionAnsweringQualityInput
Input for pairwise question answering quality metric.
| Fields | |
|---|---|
| metric_spec | 
 Required. Spec for pairwise question answering quality score metric. | 
| instance | 
 Required. Pairwise question answering quality instance. | 
PairwiseQuestionAnsweringQualityInstance
Spec for pairwise question answering quality instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the candidate model. | 
| baseline_prediction | 
 Required. Output of the baseline model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Required. Text to answer the question. | 
| instruction | 
 Required. Question Answering prompt for LLM. | 
PairwiseQuestionAnsweringQualityResult
Spec for pairwise question answering quality result.
| Fields | |
|---|---|
| pairwise_choice | Output only. Pairwise question answering prediction choice. | 
| explanation | 
 Output only. Explanation for question answering quality score. | 
| confidence | 
 Output only. Confidence for question answering quality score. | 
PairwiseQuestionAnsweringQualitySpec
Spec for pairwise question answering quality score metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute question answering quality. | 
| version | 
 Optional. Which version to use for evaluation. | 
PairwiseSummarizationQualityInput
Input for pairwise summarization quality metric.
| Fields | |
|---|---|
| metric_spec | 
 Required. Spec for pairwise summarization quality score metric. | 
| instance | 
 Required. Pairwise summarization quality instance. | 
PairwiseSummarizationQualityInstance
Spec for pairwise summarization quality instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the candidate model. | 
| baseline_prediction | 
 Required. Output of the baseline model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Required. Text to be summarized. | 
| instruction | 
 Required. Summarization prompt for LLM. | 
PairwiseSummarizationQualityResult
Spec for pairwise summarization quality result.
| Fields | |
|---|---|
| pairwise_choice | Output only. Pairwise summarization prediction choice. | 
| explanation | 
 Output only. Explanation for summarization quality score. | 
| confidence | 
 Output only. Confidence for summarization quality score. | 
PairwiseSummarizationQualitySpec
Spec for pairwise summarization quality score metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute pairwise summarization quality. | 
| version | 
 Optional. Which version to use for evaluation. | 
Part
A datatype containing media that is part of a multi-part Content message.
A Part consists of data which has an associated datatype. A Part can only contain one of the accepted types in Part.data.
A Part must have a fixed IANA MIME type identifying the type and subtype of the media if inline_data or file_data field is filled with raw bytes.
| Fields | |
|---|---|
| thought | 
 Optional. Indicates if the part is thought from the model. | 
| thought_signature | 
 Optional. An opaque signature for the thought so it can be reused in subsequent requests. | 
| Union field  
 | |
| text | 
 Optional. Text part (can be code). | 
| inline_data | Optional. Inlined bytes data. | 
| file_data | Optional. URI based data. | 
| function_call | Optional. A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] with the parameters and their values. | 
| function_response | Optional. The result output of a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function call. It is used as context to the model. | 
| executable_code | Optional. Code generated by the model that is meant to be executed. | 
| code_execution_result | Optional. Result of executing the [ExecutableCode]. | 
| Union field  
 | |
| video_metadata | Optional. Video metadata. The metadata should only be specified while the video data is presented in inline_data or file_data. | 
PartnerModelTuningSpec
Tuning spec for Partner models.
| Fields | |
|---|---|
| training_dataset_uri | 
 Required. Cloud Storage path to file containing training dataset for tuning. The dataset must be formatted as a JSONL file. | 
| validation_dataset_uri | 
 Optional. Cloud Storage path to file containing validation dataset for tuning. The dataset must be formatted as a JSONL file. | 
| hyper_parameters | 
 Hyperparameters for tuning. The accepted hyper_parameters and their valid range of values will differ depending on the base model. | 
PointwiseMetricInput
Input for pointwise metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for pointwise metric. | 
| instance | Required. Pointwise metric instance. | 
PointwiseMetricInstance
Pointwise metric instance. Usually one instance corresponds to one row in an evaluation dataset.
| Fields | |
|---|---|
| Union field instance. Instance for pointwise metric.instancecan be only one of the following: | |
| json_instance | 
 Instance specified as a json string. String key-value pairs are expected in the json_instance to render PointwiseMetricSpec.instance_prompt_template. | 
| content_map_instance | Key-value contents for the mutlimodality input, including text, image, video, audio, and pdf, etc. The key is placeholder in metric prompt template, and the value is the multimodal content. | 
PointwiseMetricResult
Spec for pointwise metric result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for pointwise metric score. | 
| custom_output | Output only. Spec for custom output. | 
| score | 
 Output only. Pointwise metric score. | 
PointwiseMetricSpec
Spec for pointwise metric.
| Fields | |
|---|---|
| custom_output_format_config | Optional. CustomOutputFormatConfig allows customization of metric output. By default, metrics return a score and explanation. When this config is set, the default output is replaced with either:  - The raw output string.  - A parsed output based on a user-defined schema. If a custom format is chosen, the  | 
| metric_prompt_template | 
 Required. Metric prompt template for pointwise metric. | 
| system_instruction | 
 Optional. System instructions for pointwise metric. | 
PreTunedModel
A pre-tuned model for continuous tuning.
| Fields | |
|---|---|
| tuned_model_name | 
 The resource name of the Model. E.g., a model resource name with a specified version id or alias: 
 
 Or, omit the version id to use the default version: 
 | 
| checkpoint_id | 
 Optional. The source checkpoint id. If not specified, the default checkpoint will be used. | 
| base_model | 
 Output only. The name of the base model this  | 
PrebuiltVoiceConfig
The configuration for the prebuilt speaker to use.
| Fields | |
|---|---|
| voice_name | 
 The name of the preset voice to use. | 
PredefinedMetricSpec
The spec for a pre-defined metric.
| Fields | |
|---|---|
| metric_spec_name | 
 Required. The name of a pre-defined metric, such as "instruction_following_v1" or "text_quality_v1". | 
| metric_spec_parameters | Optional. The parameters needed to run the pre-defined metric. | 
PredictLongRunningRequest
Request message for PredictionService.PredictLongRunning.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| instances[] | Required. The instances that are the input to the prediction call. A DeployedModel may have an upper limit on the number of instances it supports per request, and when it is exceeded the prediction call errors in case of AutoML Models, or, in case of customer created Models, the behaviour is as documented by that Model. The schema of any single instance may be specified via Endpoint's DeployedModels'  | 
| parameters | Optional. The parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels'  | 
PredictRequest
Request message for PredictionService.Predict.
| Fields | |
|---|---|
| endpoint | 
 Required. The resource name of the publisher model or endpiont requested to serve the prediction. For Google models like Embedding, Imagen, or Veo, use the publisher model format. For tuned models or other models deployed to a Vertex AI 
 , use the endpoint format. 
 | 
| instances[] | Required. The format of each instance is model-dependent. For Vertex AI Generative AI models, the instance schema can be one of the following types: 
 | 
| parameters | The format of parameters is model-dependent. For Vertex AI Generative AI models, the parameters schema can be one of the following types: 
 | 
| labels | 
 Optional. The user labels for Imagen billing usage only. Only Imagen supports labels. For other use cases, it will be ignored. | 
PredictResponse
Response message for PredictionService.Predict.
| Fields | |
|---|---|
| predictions[] | The format of each prediction is model-dependent. For Vertex AI Generative AI models, the prediction schema can be one of the following types: 
 | 
| deployed_model_id | 
 ID of the Endpoint's DeployedModel that served this prediction. | 
| model | 
 Output only. The resource name of the Model which is deployed as the DeployedModel that this prediction hits. | 
| model_version_id | 
 Output only. The version ID of the Model which is deployed as the DeployedModel that this prediction hits. | 
| model_display_name | 
 Output only. The  | 
| metadata | Output only. Request-level metadata returned by the model. The metadata type will be dependent upon the model implementation. | 
PreferenceOptimizationDataStats
Statistics computed for datasets used for preference optimization.
| Fields | |
|---|---|
| tuning_dataset_example_count | 
 Output only. Number of examples in the tuning dataset. | 
| total_billable_token_count | 
 Output only. Number of billable tokens in the tuning dataset. | 
| tuning_step_count | 
 Output only. Number of tuning steps for this Tuning Job. | 
| user_input_token_distribution | Output only. Dataset distributions for the user input tokens. | 
| user_output_token_distribution | Output only. Dataset distributions for the user output tokens. | 
| score_variance_per_example_distribution | Output only. Dataset distributions for scores variance per example. | 
| scores_distribution | Output only. Dataset distributions for scores. | 
| user_dataset_examples[] | Output only. Sample user examples in the training dataset. | 
| dropped_example_indices[] | 
 Output only. A partial sample of the indices (starting from 1) of the dropped examples. | 
| dropped_example_reasons[] | 
 Output only. For each index in  | 
PreferenceOptimizationHyperParameters
Hyperparameters for Preference Optimization.
| Fields | |
|---|---|
| adapter_size | Optional. Adapter size for preference optimization. | 
| epoch_count | 
 Optional. Number of complete passes the model makes over the entire training dataset during training. | 
| learning_rate_multiplier | 
 Optional. Multiplier for adjusting the default learning rate. | 
| beta | 
 Optional. Weight for KL Divergence regularization. | 
PreferenceOptimizationSpec
Tuning Spec for Preference Optimization.
| Fields | |
|---|---|
| training_dataset_uri | 
 Required. Cloud Storage path to file containing training dataset for preference optimization tuning. The dataset must be formatted as a JSONL file. | 
| hyper_parameters | 
 Optional. Hyperparameters for Preference Optimization. | 
| export_last_checkpoint_only | 
 Optional. If set to true, disable intermediate checkpoints for Preference Optimization and only the last checkpoint will be exported. Otherwise, enable intermediate checkpoints for Preference Optimization. Default is false. | 
| validation_dataset_uri | 
 Optional. Cloud Storage path to file containing validation dataset for preference optimization tuning. The dataset must be formatted as a JSONL file. | 
ProactivityConfig
Config for proactivity features.
| Fields | |
|---|---|
| proactive_audio | 
 Optional. If enabled, the model can reject responding to the last prompt. For example, this allows the model to ignore out of context speech or to stay silent if the user did not make a request, yet. | 
PscInterfaceConfig
Configuration for PSC-I.
| Fields | |
|---|---|
| network_attachment | 
 Optional. The name of the Compute Engine network attachment to attach to the resource within the region and user project. To specify this field, you must have already created a network attachment. This field is only used for resources using PSC-I. | 
| dns_peering_configs[] | Optional. DNS peering configurations. When specified, Vertex AI will attempt to configure DNS peering zones in the tenant project VPC to resolve the specified domains using the target network's Cloud DNS. The user must grant the dns.peer role to the Vertex AI Service Agent on the target project. | 
QueryExtensionRequest
Request message for ExtensionExecutionService.QueryExtension.
| Fields | |
|---|---|
| name | 
 Required. Name (identifier) of the extension; Format:  | 
| contents[] | Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request. | 
QueryExtensionResponse
Response message for ExtensionExecutionService.QueryExtension.
| Fields | |
|---|---|
| steps[] | Steps of extension or LLM interaction, can contain function call, function response, or text response. The last step contains the final response to the query. | 
| failure_message | 
 Failure message if any. | 
QueryReasoningEngineRequest
Request message for [ReasoningEngineExecutionService.Query][].
| Fields | |
|---|---|
| name | 
 Required. The name of the ReasoningEngine resource to use. Format:  | 
| input | Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc. | 
| class_method | 
 Optional. Class method to be used for the query. It is optional and defaults to "query" if unspecified. | 
QueryReasoningEngineResponse
Response message for [ReasoningEngineExecutionService.Query][]
| Fields | |
|---|---|
| output | Response provided by users in JSON object format. | 
QuestionAnsweringCorrectnessInput
Input for question answering correctness metric.
| Fields | |
|---|---|
| metric_spec | 
 Required. Spec for question answering correctness score metric. | 
| instance | 
 Required. Question answering correctness instance. | 
QuestionAnsweringCorrectnessInstance
Spec for question answering correctness instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Optional. Text provided as context to answer the question. | 
| instruction | 
 Required. The question asked and other instruction in the inference prompt. | 
QuestionAnsweringCorrectnessResult
Spec for question answering correctness result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for question answering correctness score. | 
| score | 
 Output only. Question Answering Correctness score. | 
| confidence | 
 Output only. Confidence for question answering correctness score. | 
QuestionAnsweringCorrectnessSpec
Spec for question answering correctness metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute question answering correctness. | 
| version | 
 Optional. Which version to use for evaluation. | 
QuestionAnsweringHelpfulnessInput
Input for question answering helpfulness metric.
| Fields | |
|---|---|
| metric_spec | 
 Required. Spec for question answering helpfulness score metric. | 
| instance | 
 Required. Question answering helpfulness instance. | 
QuestionAnsweringHelpfulnessInstance
Spec for question answering helpfulness instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Optional. Text provided as context to answer the question. | 
| instruction | 
 Required. The question asked and other instruction in the inference prompt. | 
QuestionAnsweringHelpfulnessResult
Spec for question answering helpfulness result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for question answering helpfulness score. | 
| score | 
 Output only. Question Answering Helpfulness score. | 
| confidence | 
 Output only. Confidence for question answering helpfulness score. | 
QuestionAnsweringHelpfulnessSpec
Spec for question answering helpfulness metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute question answering helpfulness. | 
| version | 
 Optional. Which version to use for evaluation. | 
QuestionAnsweringQualityInput
Input for question answering quality metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for question answering quality score metric. | 
| instance | 
 Required. Question answering quality instance. | 
QuestionAnsweringQualityInstance
Spec for question answering quality instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Required. Text to answer the question. | 
| instruction | 
 Required. Question Answering prompt for LLM. | 
QuestionAnsweringQualityResult
Spec for question answering quality result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for question answering quality score. | 
| score | 
 Output only. Question Answering Quality score. | 
| confidence | 
 Output only. Confidence for question answering quality score. | 
QuestionAnsweringQualitySpec
Spec for question answering quality score metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute question answering quality. | 
| version | 
 Optional. Which version to use for evaluation. | 
QuestionAnsweringRelevanceInput
Input for question answering relevance metric.
| Fields | |
|---|---|
| metric_spec | 
 Required. Spec for question answering relevance score metric. | 
| instance | 
 Required. Question answering relevance instance. | 
QuestionAnsweringRelevanceInstance
Spec for question answering relevance instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Optional. Text provided as context to answer the question. | 
| instruction | 
 Required. The question asked and other instruction in the inference prompt. | 
QuestionAnsweringRelevanceResult
Spec for question answering relevance result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for question answering relevance score. | 
| score | 
 Output only. Question Answering Relevance score. | 
| confidence | 
 Output only. Confidence for question answering relevance score. | 
QuestionAnsweringRelevanceSpec
Spec for question answering relevance metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute question answering relevance. | 
| version | 
 Optional. Which version to use for evaluation. | 
RagChunk
A RagChunk includes the content of a chunk of a RagFile, and associated metadata.
| Fields | |
|---|---|
| text | 
 The content of the chunk. | 
| page_span | If populated, represents where the chunk starts and ends in the document. | 
PageSpan
Represents where the chunk starts and ends in the document.
| Fields | |
|---|---|
| first_page | 
 Page where chunk starts in the document. Inclusive. 1-indexed. | 
| last_page | 
 Page where chunk ends in the document. Inclusive. 1-indexed. | 
RagContexts
Relevant contexts for one query.
| Fields | |
|---|---|
| contexts[] | All its contexts. | 
Context
A context of the query.
| Fields | |
|---|---|
| source_uri | 
 If the file is imported from Cloud Storage or Google Drive, source_uri will be original file URI in Cloud Storage or Google Drive; if file is uploaded, source_uri will be file display name. | 
| source_display_name | 
 The file display name. | 
| text | 
 The text chunk. | 
| distance | 
 The distance between the query dense embedding vector and the context text vector. | 
| sparse_distance | 
 The distance between the query sparse embedding vector and the context text vector. | 
| chunk | Context of the retrieved chunk. | 
| score | 
 According to the underlying Vector DB and the selected metric type, the score can be either the distance or the similarity between the query and the context and its range depends on the metric type. For example, if the metric type is COSINE_DISTANCE, it represents the distance between the query and the context. The larger the distance, the less relevant the context is to the query. The range is [0, 2], while 0 means the most relevant and 2 means the least relevant. | 
RagCorpus
A RagCorpus is a RagFile container and a project can have multiple RagCorpora.
| Fields | |
|---|---|
| name | 
 Output only. The resource name of the RagCorpus. | 
| display_name | 
 Required. The display name of the RagCorpus. The name can be up to 128 characters long and can consist of any UTF-8 characters. | 
| description | 
 Optional. The description of the RagCorpus. | 
| rag_embedding_model_config | Optional. Immutable. The embedding model config of the RagCorpus. | 
| rag_vector_db_config | Optional. Immutable. The Vector DB config of the RagCorpus. | 
| create_time | Output only. Timestamp when this RagCorpus was created. | 
| update_time | Output only. Timestamp when this RagCorpus was last updated. | 
| corpus_status | Output only. RagCorpus state. | 
| rag_files_count | 
 Output only. Number of RagFiles in the RagCorpus. NOTE: This field is not populated in the response of  | 
| encryption_spec | Optional. Immutable. The CMEK key name used to encrypt at-rest data related to this Corpus. Only applicable to RagManagedDb option for Vector DB. This field can only be set at corpus creation time, and cannot be updated or deleted. | 
| corpus_type_config | Optional. The corpus type config of the RagCorpus. | 
| Union field backend_config. The backend config of the RagCorpus. It can be data store and/or retrieval engine.backend_configcan be only one of the following: | |
| vector_db_config | Optional. Immutable. The config for the Vector DBs. | 
| vertex_ai_search_config | Optional. Immutable. The config for the Vertex AI Search. | 
CorpusTypeConfig
The config for the corpus type of the RagCorpus.
| Fields | |
|---|---|
| Union field corpus_type_config. Optional. Whether the RagCorpus is used as document store or memory store.corpus_type_configcan be only one of the following: | |
| document_corpus | Optional. Config for the document corpus. | 
| memory_corpus | Optional. Config for the memory corpus. | 
DocumentCorpus
This type has no fields.
Config for the document corpus.
MemoryCorpus
Config for the memory corpus.
| Fields | |
|---|---|
| llm_parser | The LLM parser to use for the memory corpus. | 
RagEmbeddingModelConfig
Config for the embedding model to use for RAG.
| Fields | |
|---|---|
| Union field model_config. The model config to use.model_configcan be only one of the following: | |
| vertex_prediction_endpoint | The Vertex AI Prediction Endpoint that either refers to a publisher model or an endpoint that is hosting a 1P fine-tuned text embedding model. Endpoints hosting non-1P fine-tuned text embedding models are currently not supported. This is used for dense vector search. | 
| hybrid_search_config | Configuration for hybrid search. | 
HybridSearchConfig
Config for hybrid search.
| Fields | |
|---|---|
| sparse_embedding_config | Optional. The configuration for sparse embedding generation. This field is optional the default behavior depends on the vector database choice on the RagCorpus. | 
| dense_embedding_model_prediction_endpoint | Required. The Vertex AI Prediction Endpoint that hosts the embedding model for dense embedding generations. | 
SparseEmbeddingConfig
Configuration for sparse emebdding generation.
| Fields | |
|---|---|
| Union field model. The model to use for sparse embedding generation.modelcan be only one of the following: | |
| bm25 | Use BM25 scoring algorithm. | 
Bm25
Message for BM25 parameters.
| Fields | |
|---|---|
| multilingual | 
 Optional. Use multilingual tokenizer if set to true. | 
| k1 | 
 Optional. The parameter to control term frequency saturation. It determines the scaling between the matching term frequency and final score. k1 is in the range of [1.2, 3]. The default value is 1.2. | 
| b | 
 Optional. The parameter to control document length normalization. It determines how much the document length affects the final score. b is in the range of [0, 1]. The default value is 0.75. | 
VertexPredictionEndpoint
Config representing a model hosted on Vertex Prediction Endpoint.
| Fields | |
|---|---|
| endpoint | 
 Required. The endpoint resource name. Format:  | 
| model | 
 Output only. The resource name of the model that is deployed on the endpoint. Present only when the endpoint is not a publisher model. Pattern:  | 
| model_version_id | 
 Output only. Version ID of the model that is deployed on the endpoint. Present only when the endpoint is not a publisher model. | 
RagEngineConfig
Config for RagEngine.
| Fields | |
|---|---|
| name | 
 Identifier. The name of the RagEngineConfig. Format:  | 
| rag_managed_db_config | The config of the RagManagedDb used by RagEngine. | 
RagFile
A RagFile contains user data for chunking, embedding and indexing.
| Fields | |
|---|---|
| name | 
 Output only. The resource name of the RagFile. | 
| display_name | 
 Required. The display name of the RagFile. The name can be up to 128 characters long and can consist of any UTF-8 characters. | 
| description | 
 Optional. The description of the RagFile. | 
| size_bytes | 
 Output only. The size of the RagFile in bytes. | 
| rag_file_type | Output only. The type of the RagFile. | 
| create_time | Output only. Timestamp when this RagFile was created. | 
| update_time | Output only. Timestamp when this RagFile was last updated. | 
| file_status | Output only. State of the RagFile. | 
| user_metadata | 
 Output only. The metadata for metadata search. The user_metadata Needs to be in JSON format. | 
| Union field rag_file_source. The origin location of the RagFile if it is imported from Google Cloud Storage or Google Drive.rag_file_sourcecan be only one of the following: | |
| gcs_source | Output only. Google Cloud Storage location of the RagFile. It does not support wildcards in the Cloud Storage uri for now. | 
| google_drive_source | Output only. Google Drive location. Supports importing individual files as well as Google Drive folders. | 
| direct_upload_source | Output only. The RagFile is encapsulated and uploaded in the UploadRagFile request. | 
| slack_source | The RagFile is imported from a Slack channel. | 
| jira_source | The RagFile is imported from a Jira query. | 
| share_point_sources | The RagFile is imported from a SharePoint source. | 
RagFileType
The type of the RagFile.
| Enums | |
|---|---|
| RAG_FILE_TYPE_UNSPECIFIED | RagFile type is unspecified. | 
| RAG_FILE_TYPE_TXT | RagFile type is TXT. | 
| RAG_FILE_TYPE_PDF | RagFile type is PDF. | 
RagFileChunkingConfig
Specifies the size and overlap of chunks for RagFiles.
| Fields | |
|---|---|
| chunk_size | 
 The size of the chunks. | 
| chunk_overlap | 
 The overlap between chunks. | 
| Union field chunking_config. Specifies the chunking config for RagFiles.chunking_configcan be only one of the following: | |
| fixed_length_chunking | Specifies the fixed length chunking config. | 
FixedLengthChunking
Specifies the fixed length chunking config.
| Fields | |
|---|---|
| chunk_size | 
 The size of the chunks. | 
| chunk_overlap | 
 The overlap between chunks. | 
RagFileMetadataConfig
Metadata config for RagFile.
| Fields | |
|---|---|
| Union field metadata_schema_source. Specifies the metadata schema source.metadata_schema_sourcecan be only one of the following: | |
| gcs_metadata_schema_source | Google Cloud Storage location. Supports importing individual files as well as entire Google Cloud Storage directories. Sample formats: -  | 
| google_drive_metadata_schema_source | Google Drive location. Supports importing individual files as well as Google Drive folders. If the user provides a folder, the metadata schema will be read from the files that ends with "metadata_schema.json" in the directory. | 
| inline_metadata_schema_source | 
 Inline metadata schema source. Must be a JSON string. | 
| Union field metadata_source. Specifies the metadata source.metadata_sourcecan be only one of the following: | |
| gcs_metadata_source | Google Cloud Storage location. Supports importing individual files as well as entire Google Cloud Storage directories. Sample formats: -  | 
| google_drive_metadata_source | Google Drive location. Supports importing individual files as well as Google Drive folders. If the user provides a directory, the metadata will be read from the files that ends with "metadata.json" in the directory. | 
| inline_metadata_source | 
 Inline metadata source. Must be a JSON string. | 
RagFileParsingConfig
Specifies the parsing config for RagFiles.
| Fields | |
|---|---|
| use_advanced_pdf_parsing | 
 Whether to use advanced PDF parsing. | 
| Union field parser. The parser to use for RagFiles.parsercan be only one of the following: | |
| advanced_parser | The Advanced Parser to use for RagFiles. | 
| layout_parser | The Layout Parser to use for RagFiles. | 
| llm_parser | The LLM Parser to use for RagFiles. | 
AdvancedParser
Specifies the advanced parsing for RagFiles.
| Fields | |
|---|---|
| use_advanced_pdf_parsing | 
 Whether to use advanced PDF parsing. | 
LayoutParser
Document AI Layout Parser config.
| Fields | |
|---|---|
| processor_name | 
 The full resource name of a Document AI processor or processor version. The processor must have type  | 
| max_parsing_requests_per_min | 
 The maximum number of requests the job is allowed to make to the Document AI processor per minute. Consult https://cloud.google.com/document-ai/quotas and the Quota page for your project to set an appropriate value here. If unspecified, a default value of 120 QPM would be used. | 
| global_max_parsing_requests_per_min | 
 The maximum number of requests the job is allowed to make to the Document AI processor per minute in this project. Consult https://cloud.google.com/document-ai/quotas and the Quota page for your project to set an appropriate value here. If this value is not specified, max_parsing_requests_per_min will be used by indexing pipeline as the global limit. | 
LlmParser
Specifies the LLM parsing for RagFiles.
| Fields | |
|---|---|
| model_name | 
 The name of a LLM model used for parsing. Format: *  | 
| max_parsing_requests_per_min | 
 The maximum number of requests the job is allowed to make to the LLM model per minute. Consult https://cloud.google.com/vertex-ai/generative-ai/docs/quotas and your document size to set an appropriate value here. If unspecified, a default value of 5000 QPM would be used. | 
| global_max_parsing_requests_per_min | 
 The maximum number of requests the job is allowed to make to the LLM model per minute in this project. Consult https://cloud.google.com/vertex-ai/generative-ai/docs/quotas and your document size to set an appropriate value here. If this value is not specified, max_parsing_requests_per_min will be used by indexing pipeline job as the global limit. | 
| custom_parsing_prompt | 
 The prompt to use for parsing. If not specified, a default prompt will be used. | 
RagFileTransformationConfig
Specifies the transformation config for RagFiles.
| Fields | |
|---|---|
| rag_file_chunking_config | Specifies the chunking config for RagFiles. | 
RagManagedDbConfig
Configuration message for RagManagedDb used by RagEngine.
| Fields | |
|---|---|
| Union field tier. The tier of the RagManagedDb.tiercan be only one of the following: | |
| enterprise | Sets the RagManagedDb to the Enterprise tier. | 
| scaled | Sets the RagManagedDb to the Scaled tier. This is the default tier if not explicitly chosen. | 
| basic | Sets the RagManagedDb to the Basic tier. | 
| unprovisioned | Sets the RagManagedDb to the Unprovisioned tier. | 
Basic
This type has no fields.
Basic tier is a cost-effective and low compute tier suitable for the following cases: * Experimenting with RagManagedDb. * Small data size. * Latency insensitive workload. * Only using RAG Engine with external vector DBs.
NOTE: This is the default tier if not explicitly chosen.
Enterprise
This type has no fields.
Enterprise tier offers production grade performance along with autoscaling functionality. It is suitable for customers with large amounts of data or performance sensitive workloads.
Scaled
This type has no fields.
Scaled tier offers production grade performance along with autoscaling functionality. It is suitable for customers with large amounts of data or performance sensitive workloads.
Unprovisioned
This type has no fields.
Disables the RAG Engine service and deletes all your data held within this service. This will halt the billing of the service.
NOTE: Once deleted the data cannot be recovered. To start using RAG Engine again, you will need to update the tier by calling the UpdateRagEngineConfig API.
RagQuery
A query to retrieve relevant contexts.
| Fields | |
|---|---|
| similarity_top_k | 
 Optional. The number of contexts to retrieve. | 
| ranking | Optional. Configurations for hybrid search results ranking. | 
| rag_retrieval_config | Optional. The retrieval config for the query. | 
| Union field query. The query to retrieve contexts. Currently only text query is supported.querycan be only one of the following: | |
| text | 
 Optional. The query in text format to get relevant contexts. | 
Ranking
Configurations for hybrid search results ranking.
| Fields | |
|---|---|
| alpha | 
 Optional. Alpha value controls the weight between dense and sparse vector search results. The range is [0, 1], while 0 means sparse vector search only and 1 means dense vector search only. The default value is 0.5 which balances sparse and dense vector search equally. | 
RagRetrievalConfig
Specifies the context retrieval config.
| Fields | |
|---|---|
| top_k | 
 Optional. The number of contexts to retrieve. | 
| hybrid_search | Optional. Config for Hybrid Search. | 
| filter | Optional. Config for filters. | 
| ranking | Optional. Config for ranking and reranking. | 
Filter
Config for filters.
| Fields | |
|---|---|
| metadata_filter | 
 Optional. String for metadata filtering. | 
| Union field vector_db_threshold. Filter contexts retrieved from the vector DB based on either vector distance or vector similarity.vector_db_thresholdcan be only one of the following: | |
| vector_distance_threshold | 
 Optional. Only returns contexts with vector distance smaller than the threshold. | 
| vector_similarity_threshold | 
 Optional. Only returns contexts with vector similarity larger than the threshold. | 
HybridSearch
Config for Hybrid Search.
| Fields | |
|---|---|
| alpha | 
 Optional. Alpha value controls the weight between dense and sparse vector search results. The range is [0, 1], while 0 means sparse vector search only and 1 means dense vector search only. The default value is 0.5 which balances sparse and dense vector search equally. | 
Ranking
Config for ranking and reranking.
| Fields | |
|---|---|
| Union field ranking_config. Config options for ranking. Currently only Rank Service is supported.ranking_configcan be only one of the following: | |
| rank_service | Optional. Config for Rank Service. | 
| llm_ranker | Optional. Config for LlmRanker. | 
LlmRanker
Config for LlmRanker.
| Fields | |
|---|---|
| model_name | 
 Optional. The model name used for ranking. See Supported models. | 
RankService
Config for Rank Service.
| Fields | |
|---|---|
| model_name | 
 Optional. The model name of the rank service. Format:  | 
RagVectorDbConfig
Config for the Vector DB to use for RAG.
| Fields | |
|---|---|
| api_auth | Authentication config for the chosen Vector DB. | 
| rag_embedding_model_config | Optional. Immutable. The embedding model config of the Vector DB. | 
| Union field vector_db. The config for the Vector DB.vector_dbcan be only one of the following: | |
| rag_managed_db | The config for the RAG-managed Vector DB. | 
| weaviate | The config for the Weaviate. | 
| pinecone | The config for the Pinecone. | 
| vertex_feature_store | The config for the Vertex Feature Store. | 
| vertex_vector_search | The config for the Vertex Vector Search. | 
Pinecone
The config for the Pinecone.
| Fields | |
|---|---|
| index_name | 
 Pinecone index name. This value cannot be changed after it's set. | 
RagManagedDb
The config for the default RAG-managed Vector DB.
| Fields | |
|---|---|
| Union field retrieval_strategy. Choice of retrieval strategy.retrieval_strategycan be only one of the following: | |
| knn | Performs a KNN search on RagCorpus. Default choice if not specified. | 
| ann | Performs an ANN search on RagCorpus. Use this if you have a lot of files (> 10K) in your RagCorpus and want to reduce the search latency. | 
ANN
Config for ANN search.
RagManagedDb uses a tree-based structure to partition data and facilitate faster searches. As a tradeoff, it requires longer indexing time and manual triggering of index rebuild via the ImportRagFiles and UpdateRagCorpus API.
| Fields | |
|---|---|
| tree_depth | 
 The depth of the tree-based structure. Only depth values of 2 and 3 are supported. Recommended value is 2 if you have if you have O(10K) files in the RagCorpus and set this to 3 if more than that. Default value is 2. | 
| leaf_count | 
 Number of leaf nodes in the tree-based structure. Each leaf node contains groups of closely related vectors along with their corresponding centroid. Recommended value is 10 * sqrt(num of RagFiles in your RagCorpus). Default value is 500. | 
KNN
This type has no fields.
Config for KNN search.
VertexFeatureStore
The config for the Vertex Feature Store.
| Fields | |
|---|---|
| feature_view_resource_name | 
 The resource name of the FeatureView. Format:  | 
VertexVectorSearch
The config for the Vertex Vector Search.
| Fields | |
|---|---|
| index_endpoint | 
 The resource name of the Index Endpoint. Format:  | 
| index | 
 The resource name of the Index. Format:  | 
Weaviate
The config for the Weaviate.
| Fields | |
|---|---|
| http_endpoint | 
 Weaviate DB instance HTTP endpoint. e.g. 34.56.78.90:8080 Vertex RAG only supports HTTP connection to Weaviate. This value cannot be changed after it's set. | 
| collection_name | 
 The corresponding collection this corpus maps to. This value cannot be changed after it's set. | 
RawOutput
Raw output.
| Fields | |
|---|---|
| raw_output[] | 
 Output only. Raw output string. | 
RawPredictRequest
Request message for PredictionService.RawPredict.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| http_body | The prediction input. Supports HTTP headers and arbitrary data payload. A  You can specify the schema for each instance in the  | 
RealtimeInputConfig
Configures the realtime input behavior in BidiGenerateContent.
| Fields | |
|---|---|
| automatic_activity_detection | Optional. If not set, automatic activity detection is enabled by default. If automatic voice detection is disabled, the client must send activity signals. | 
| activity_handling | Optional. Defines what effect activity has. | 
| turn_coverage | Optional. Defines which input is included in the user's turn. | 
ActivityHandling
The different ways of handling user activity.
| Enums | |
|---|---|
| ACTIVITY_HANDLING_UNSPECIFIED | If unspecified, the default behavior is START_OF_ACTIVITY_INTERRUPTS. | 
| START_OF_ACTIVITY_INTERRUPTS | If true, start of activity will interrupt the model's response (also called "barge in"). The model's current response will be cut-off in the moment of the interruption. This is the default behavior. | 
| NO_INTERRUPTION | The model's response will not be interrupted. | 
AutomaticActivityDetection
Configures automatic detection of activity.
| Fields | |
|---|---|
| start_of_speech_sensitivity | Optional. Determines how likely speech is to be detected. | 
| end_of_speech_sensitivity | Optional. Determines how likely detected speech is ended. | 
| prefix_padding_ms | 
 Optional. The required duration of detected speech before start-of-speech is committed. The lower this value the more sensitive the start-of-speech detection is and the shorter speech can be recognized. However, this also increases the probability of false positives. | 
| silence_duration_ms | 
 Optional. The required duration of detected silence (or non-speech) before end-of-speech is committed. The larger this value, the longer speech gaps can be without interrupting the user's activity but this will increase the model's latency. | 
| disabled | 
 Optional. If enabled, detected voice and text input count as activity. If disabled, the client must send activity signals. | 
EndSensitivity
End of speech sensitivity.
| Enums | |
|---|---|
| END_SENSITIVITY_UNSPECIFIED | The default is END_SENSITIVITY_LOW. | 
| END_SENSITIVITY_HIGH | Automatic detection ends speech more often. | 
| END_SENSITIVITY_LOW | Automatic detection ends speech less often. | 
StartSensitivity
Start of speech sensitivity.
| Enums | |
|---|---|
| START_SENSITIVITY_UNSPECIFIED | The default is START_SENSITIVITY_LOW. | 
| START_SENSITIVITY_HIGH | Automatic detection will detect the start of speech more often. | 
| START_SENSITIVITY_LOW | Automatic detection will detect the start of speech less often. | 
TurnCoverage
Options about which input is included in the user's turn.
| Enums | |
|---|---|
| TURN_COVERAGE_UNSPECIFIED | If unspecified, the default behavior is TURN_INCLUDES_ALL_INPUT. | 
| TURN_INCLUDES_ONLY_ACTIVITY | The users turn only includes activity since the last turn, excluding inactivity (e.g. silence on the audio stream). | 
| TURN_INCLUDES_ALL_INPUT | The users turn includes all realtime input since the last turn, including inactivity (e.g. silence on the audio stream). This is the default behavior. | 
ReasoningEngine
ReasoningEngine provides a customizable runtime for models to determine which actions to take and in which order.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the ReasoningEngine. Format:  | 
| display_name | 
 Required. The display name of the ReasoningEngine. | 
| description | 
 Optional. The description of the ReasoningEngine. | 
| spec | Optional. Configurations of the ReasoningEngine | 
| create_time | Output only. Timestamp when this ReasoningEngine was created. | 
| update_time | Output only. Timestamp when this ReasoningEngine was most recently updated. | 
| etag | 
 Optional. Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens. | 
| context_spec | Optional. Configuration for how Agent Engine sub-resources should manage context. | 
| encryption_spec | Customer-managed encryption key spec for a ReasoningEngine. If set, this ReasoningEngine and all sub-resources of this ReasoningEngine will be secured by this key. | 
| labels | 
 Labels for the ReasoningEngine. | 
ReasoningEngineContextSpec
Configuration for how Agent Engine sub-resources should manage context.
| Fields | |
|---|---|
| memory_bank_config | Optional. Specification for a Memory Bank, which manages memories for the Agent Engine. | 
MemoryBankConfig
Specification for a Memory Bank.
| Fields | |
|---|---|
| generation_config | Optional. Configuration for how to generate memories for the Memory Bank. | 
| similarity_search_config | Optional. Configuration for how to perform similarity search on memories. If not set, the Memory Bank will use the default embedding model  | 
| customization_configs[] | Optional. Configuration for how to customize Memory Bank behavior for a particular scope. | 
| ttl_config | Optional. Configuration for automatic TTL ("time-to-live") of the memories in the Memory Bank. If not set, TTL will not be applied automatically. The TTL can be explicitly set by modifying the  | 
| disable_memory_revisions | 
 If true, no memory revisions will be created for any requests to the Memory Bank. | 
GenerationConfig
Configuration for how to generate memories.
| Fields | |
|---|---|
| model | 
 Required. The model used to generate memories. Format:  | 
SimilaritySearchConfig
Configuration for how to perform similarity search on memories.
| Fields | |
|---|---|
| embedding_model | 
 Required. The model used to generate embeddings to lookup similar memories. Format:  | 
TtlConfig
Configuration for automatically setting the TTL ("time-to-live") of the memories in the Memory Bank.
| Fields | |
|---|---|
| Union field ttl. Configuration for automatically setting the TTL of the memories in the Memory Bank.ttlcan be only one of the following: | |
| default_ttl | Optional. The default TTL duration of the memories in the Memory Bank. This applies to all operations that create or update a memory. | 
| granular_ttl_config | Optional. The granular TTL configuration of the memories in the Memory Bank. | 
| Union field memory_revision_ttl. Configuration for automatically setting the TTL of the memory revisions in the Memory Bank.memory_revision_ttlcan be only one of the following: | |
| memory_revision_default_ttl | Optional. The default TTL duration of the memory revisions in the Memory Bank. This applies to all operations that create a memory revision. If not set, a default TTL of 365 days will be used. | 
GranularTtlConfig
Configuration for TTL of the memories in the Memory Bank based on the action that created or updated the memory.
| Fields | |
|---|---|
| create_ttl | Optional. The TTL duration for memories uploaded via CreateMemory. | 
| generate_created_ttl | Optional. The TTL duration for memories newly generated via GenerateMemories ( | 
| generate_updated_ttl | Optional. The TTL duration for memories updated via GenerateMemories ( | 
ReasoningEngineSpec
ReasoningEngine configurations
| Fields | |
|---|---|
| package_spec | Optional. User provided package spec of the ReasoningEngine. Ignored when users directly specify a deployment image through  | 
| deployment_spec | Optional. The specification of a Reasoning Engine deployment. | 
| class_methods[] | Optional. Declarations for object class methods in OpenAPI specification format. | 
| agent_framework | 
 Optional. The OSS agent framework used to develop the agent. Currently supported values: "google-adk", "langchain", "langgraph", "ag2", "llama-index", "custom". | 
| Union field deployment_source. Defines the source for the deployment. Thepackage_specfield should not be set ifdeployment_sourceis specified.deployment_sourcecan be only one of the following: | |
| source_code_spec | Deploy from source code files with a defined entrypoint. | 
| service_account | 
 Optional. The service account that the Reasoning Engine artifact runs as. It should have "roles/storage.objectViewer" for reading the user project's Cloud Storage and "roles/aiplatform.user" for using Vertex extensions. If not specified, the Vertex AI Reasoning Engine Service Agent in the project will be used. | 
DeploymentSpec
The specification of a Reasoning Engine deployment.
| Fields | |
|---|---|
| env[] | Optional. Environment variables to be set with the Reasoning Engine deployment. The environment variables can be updated through the UpdateReasoningEngine API. | 
| secret_env[] | Optional. Environment variables where the value is a secret in Cloud Secret Manager. To use this feature, add 'Secret Manager Secret Accessor' role (roles/secretmanager.secretAccessor) to AI Platform Reasoning Engine Service Agent. | 
| agent_server_mode | The agent server mode. | 
| psc_interface_config | Optional. Configuration for PSC-I. | 
| resource_limits | 
 Optional. Resource limits for each container. Only 'cpu' and 'memory' keys are supported. Defaults to {"cpu": "4", "memory": "4Gi"}. 
 | 
| min_instances | 
 Optional. The minimum number of application instances that will be kept running at all times. Defaults to 1. Range: [0, 10]. | 
| max_instances | 
 Optional. The maximum number of application instances that can be launched to handle increased traffic. Defaults to 100. Range: [1, 1000]. If VPC-SC or PSC-I is enabled, the acceptable range is [1, 100]. | 
| container_concurrency | 
 Optional. Concurrency for each container and agent server. Recommended value: 2 * cpu + 1. Defaults to 9. | 
AgentServerMode
The agent server mode specifies what features are used when deploy the agent to agent engine.
| Enums | |
|---|---|
| AGENT_SERVER_MODE_UNSPECIFIED | Unspecified agent server mode. Do not use. | 
| STABLE | Stable agent server mode. This mode has everything stable and well-tested features agent engine offers. | 
| EXPERIMENTAL | Experimental agent server mode. This mode contains experimental features. | 
PackageSpec
User-provided package specification, containing pickled object and package requirements.
| Fields | |
|---|---|
| pickle_object_gcs_uri | 
 Optional. The Cloud Storage URI of the pickled python object. | 
| dependency_files_gcs_uri | 
 Optional. The Cloud Storage URI of the dependency files in tar.gz format. | 
| requirements_gcs_uri | 
 Optional. The Cloud Storage URI of the  | 
| python_version | 
 Optional. The Python version. Currently support 3.8, 3.9, 3.10, 3.11. If not specified, default value is 3.10. | 
SourceCodeSpec
Specification for deploying from source code.
| Fields | |
|---|---|
| Union field source. Specifies where the source code is located.sourcecan be only one of the following: | |
| inline_source | Source code is provided directly in the request. | 
| Union field language_spec. Specifies the language-specific configuration for building and running the code.language_speccan be only one of the following: | |
| python_spec | Configuration for a Python application. | 
InlineSource
Specifies source code provided as a byte stream.
| Fields | |
|---|---|
| source_archive | 
 Required. Input only. The application source code archive, provided as a compressed tarball (.tar.gz) file. | 
PythonSpec
Specification for running a Python application from source.
| Fields | |
|---|---|
| version | 
 Optional. The version of Python to use. Support version includes 3.9, 3.10, 3.11, 3.12, 3.13. If not specified, default value is 3.10. | 
| entrypoint_module | 
 Optional. The Python module to load as the entrypoint, specified as a fully qualified module name. For example: path.to.agent. If not specified, defaults to "agent". The project root will be added to Python sys.path, allowing imports to be specified relative to the root. | 
| entrypoint_object | 
 Optional. The name of the callable object within the  | 
| requirements_file | 
 Optional. The path to the requirements file, relative to the source root. If not specified, defaults to "requirements.txt". | 
RebaseTunedModelOperationMetadata
Runtime operation information for GenAiTuningService.RebaseTunedModel.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation generic information. | 
RebaseTunedModelRequest
Request message for GenAiTuningService.RebaseTunedModel.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location into which to rebase the Model. Format:  | 
| tuned_model_ref | Required. TunedModel reference to retrieve the legacy model information. | 
| tuning_job | Optional. The TuningJob to be updated. Users can use this TuningJob field to overwrite tuning configs. | 
| artifact_destination | Optional. The Google Cloud Storage location to write the artifacts. | 
| deploy_to_same_endpoint | 
 Optional. By default, bison to gemini migration will always create new model/endpoint, but for gemini-1.0 to gemini-1.5 migration, we default deploy to the same endpoint. See details in this Section. | 
Retrieval
Defines a retrieval tool that model can call to access external knowledge.
| Fields | |
|---|---|
| disable_attribution | 
 Optional. Deprecated. This option is no longer supported. | 
| Union field source. The source of the retrieval.sourcecan be only one of the following: | |
| vertex_ai_search | Set to use data source powered by Vertex AI Search. | 
| vertex_rag_store | Set to use data source powered by Vertex RAG store. User data is uploaded via the VertexRagDataService. | 
| external_api | Use data source powered by external API for grounding. | 
RetrievalConfig
Retrieval config.
| Fields | |
|---|---|
| lat_lng | The location of the user. | 
| language_code | 
 The language code of the user. | 
RetrievalMetadata
Metadata related to retrieval in the grounding flow.
| Fields | |
|---|---|
| google_search_dynamic_retrieval_score | 
 Optional. Score indicating how likely information from Google Search could help answer the prompt. The score is in the range  | 
RetrieveContextsRequest
Request message for VertexRagService.RetrieveContexts.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the Location from which to retrieve RagContexts. The users must have permission to make a call in the project. Format:  | 
| query | Required. Single RAG retrieve query. | 
| Union field data_source. Data Source to retrieve contexts.data_sourcecan be only one of the following: | |
| vertex_rag_store | The data source for Vertex RagStore. | 
VertexRagStore
The data source for Vertex RagStore.
| Fields | |
|---|---|
| rag_corpora[] | 
 Optional. Deprecated. Please use rag_resources to specify the data source. | 
| rag_resources[] | Optional. The representation of the rag source. It can be used to specify corpus only or ragfiles. Currently only support one corpus or multiple files from one corpus. In the future we may open up multiple corpora support. | 
| vector_distance_threshold | 
 Optional. Only return contexts with vector distance smaller than the threshold. | 
RagResource
The definition of the Rag resource.
| Fields | |
|---|---|
| rag_corpus | 
 Optional. RagCorpora resource name. Format:  | 
| rag_file_ids[] | 
 Optional. rag_file_id. The files should be in the same rag_corpus set in rag_corpus field. | 
RetrieveContextsResponse
Response message for VertexRagService.RetrieveContexts.
| Fields | |
|---|---|
| contexts | The contexts of the query. | 
RetrieveMemoriesRequest
Request message for MemoryBankService.RetrieveMemories.
| Fields | |
|---|---|
| parent | 
 Required. The resource name of the ReasoningEngine to retrieve memories from. Format:  | 
| scope | 
 Required. The scope of the memories to retrieve. A memory must have exactly the same scope ( | 
| filter | 
 Optional. The standard list filter that will be applied to the retrieved memories. More detail in AIP-160. Supported fields:  *  | 
| Union field retrieval_params. Parameters for retrieval.retrieval_paramscan be only one of the following: | |
| similarity_search_params | Parameters for semantic similarity search based retrieval. | 
| simple_retrieval_params | Parameters for simple (non-similarity search) retrieval. | 
SimilaritySearchParams
Parameters for semantic similarity search based retrieval.
| Fields | |
|---|---|
| search_query | 
 Required. Query to use for similarity search retrieval. If provided, then the parent ReasoningEngine must have  | 
| top_k | 
 Optional. The maximum number of memories to return. The service may return fewer than this value. If unspecified, at most 3 memories will be returned. The maximum value is 100; values above 100 will be coerced to 100. | 
SimpleRetrievalParams
Parameters for simple (non-similarity search) retrieval.
| Fields | |
|---|---|
| page_size | 
 Optional. The maximum number of memories to return. The service may return fewer than this value. If unspecified, at most 3 memories will be returned. The maximum value is 100; values above 100 will be coerced to 100. | 
| page_token | 
 Optional. A page token, received from a previous  | 
RetrieveMemoriesResponse
Response message for MemoryBankService.RetrieveMemories.
| Fields | |
|---|---|
| retrieved_memories[] | The retrieved memories. | 
| next_page_token | 
 A token that can be sent as  | 
RetrievedMemory
A retrieved memory.
| Fields | |
|---|---|
| memory | The retrieved Memory. | 
| distance | 
 The distance between the query and the retrieved Memory. Smaller values indicate more similar memories. This is only set if similarity search was used for retrieval. | 
RollbackMemoryOperationMetadata
Details of MemoryBankService.RollbackMemory operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
RollbackMemoryRequest
Request message for MemoryBankService.RollbackMemory.
| Fields | |
|---|---|
| name | 
 Required. The resource name of the Memory to rollback. Format:  | 
| Union field target. The target to rollback to.targetcan be only one of the following: | |
| target_revision_id | 
 Required. The ID of the revision to rollback to. | 
RollbackMemoryResponse
This type has no fields.
Response message for MemoryBankService.RollbackMemory.
RougeInput
Input for rouge metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for rouge score metric. | 
| instances[] | Required. Repeated rouge instances. | 
RougeInstance
Spec for rouge instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
RougeMetricValue
Rouge metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Rouge score. | 
RougeResults
Results for rouge metric.
| Fields | |
|---|---|
| rouge_metric_values[] | Output only. Rouge metric values. | 
RougeSpec
Spec for rouge score metric - calculates the recall of n-grams in prediction as compared to reference - returns a score ranging between 0 and 1.
| Fields | |
|---|---|
| rouge_type | 
 Optional. Supported rouge types are rougen[1-9], rougeL, and rougeLsum. | 
| use_stemmer | 
 Optional. Whether to use stemmer to compute rouge score. | 
| split_summaries | 
 Optional. Whether to split summaries while using rougeLsum. | 
Rubric
Message representing a single testable criterion for evaluation. One input prompt could have multiple rubrics.
| Fields | |
|---|---|
| rubric_id | 
 Unique identifier for the rubric. This ID is used to refer to this rubric, e.g., in RubricVerdict. | 
| content | Required. The actual testable criteria for the rubric. | 
| type | 
 Optional. A type designator for the rubric, which can inform how it's evaluated or interpreted by systems or users. It's recommended to use consistent, well-defined, upper snake_case strings. Examples: "SUMMARIZATION_QUALITY", "SAFETY_HARMFUL_CONTENT", "INSTRUCTION_ADHERENCE". | 
| importance | Optional. The relative importance of this rubric. | 
Content
Content of the rubric, defining the testable criteria.
| Fields | |
|---|---|
| Union field  
 | |
| property | Evaluation criteria based on a specific property. | 
Property
Defines criteria based on a specific property.
| Fields | |
|---|---|
| description | 
 Description of the property being evaluated. Example: "The model's response is grammatically correct." | 
Importance
Importance level of the rubric.
| Enums | |
|---|---|
| IMPORTANCE_UNSPECIFIED | Importance is not specified. | 
| HIGH | High importance. | 
| MEDIUM | Medium importance. | 
| LOW | Low importance. | 
RubricBasedInstructionFollowingInput
Instance and metric spec for RubricBasedInstructionFollowing metric.
| Fields | |
|---|---|
| metric_spec | 
 Required. Spec for RubricBasedInstructionFollowing metric. | 
| instance | 
 Required. Instance for RubricBasedInstructionFollowing metric. | 
RubricBasedInstructionFollowingInstance
Instance for RubricBasedInstructionFollowing metric - one instance corresponds to one row in an evaluation dataset.
| Fields | |
|---|---|
| Union field instance. Instance for RubricBasedInstructionFollowing metric.instancecan be only one of the following: | |
| json_instance | 
 Required. Instance specified as a json string. String key-value pairs are expected in the json_instance to render RubricBasedInstructionFollowing prompt templates. | 
RubricBasedInstructionFollowingResult
Result for RubricBasedInstructionFollowing metric.
| Fields | |
|---|---|
| rubric_critique_results[] | Output only. List of per rubric critique results. | 
| score | 
 Output only. Overall score for the instruction following. | 
RubricBasedInstructionFollowingSpec
This type has no fields.
Spec for RubricBasedInstructionFollowing metric - returns rubrics and verdicts corresponding to rubrics along with overall score.
RubricCritiqueResult
Rubric critique result.
| Fields | |
|---|---|
| rubric | 
 Output only. Rubric to be evaluated. | 
| verdict | 
 Output only. Verdict for the rubric - true if the rubric is met, false otherwise. | 
RubricGenerationSpec
Specification for how rubrics should be generated.
| Fields | |
|---|---|
| prompt_template | 
 Template for the prompt used to generate rubrics. The details should be updated based on the most-recent recipe requirements. | 
| rubric_content_type | The type of rubric content to be generated. | 
| rubric_type_ontology[] | 
 Optional. An optional, pre-defined list of allowed types for generated rubrics. If this field is provided, it implies  | 
| model_config | Configuration for the model used in rubric generation. Configs including sampling count and base model can be specified here. Flipping is not supported for rubric generation. | 
RubricContentType
Specifies the type of rubric content to generate.
| Enums | |
|---|---|
| RUBRIC_CONTENT_TYPE_UNSPECIFIED | The content type to generate is not specified. | 
| PROPERTY | Generate rubrics based on properties. | 
| NL_QUESTION_ANSWER | Generate rubrics in an NL question answer format. | 
| PYTHON_CODE_ASSERTION | Generate rubrics in a unit test format. | 
RubricGroup
A group of rubrics, used for grouping rubrics based on a metric or a version.
| Fields | |
|---|---|
| group_id | 
 Unique identifier for the group. | 
| display_name | 
 Human-readable name for the group. This should be unique within a given context if used for display or selection. Example: "Instruction Following V1", "Content Quality - Summarization Task". | 
| rubrics[] | Rubrics that are part of this group. | 
RubricVerdict
Represents the verdict of an evaluation against a single rubric.
| Fields | |
|---|---|
| evaluated_rubric | Required. The full rubric definition that was evaluated. Storing this ensures the verdict is self-contained and understandable, especially if the original rubric definition changes or was dynamically generated. | 
| verdict | 
 Required. Outcome of the evaluation against the rubric, represented as a boolean.  | 
| reasoning | 
 Optional. Human-readable reasoning or explanation for the verdict. This can include specific examples or details from the evaluated content that justify the given verdict. | 
RuntimeConfig
Runtime configuration to run the extension.
| Fields | |
|---|---|
| default_params | Optional. Default parameters that will be set for all the execution of this extension. If specified, the parameter values can be overridden by values in [[ExecuteExtensionRequest.operation_params]] at request time. The struct should be in a form of map with param name as the key and actual param value as the value. E.g. If this operation requires a param "name" to be set to "abc". you can set this to something like {"name": "abc"}. | 
| Union field GoogleFirstPartyExtensionConfig. Runtime configurations for Google first party extensions.GoogleFirstPartyExtensionConfigcan be only one of the following: | |
| code_interpreter_runtime_config | Code execution runtime configurations for code interpreter extension. | 
| vertex_ai_search_runtime_config | Runtime configuration for Vertex AI Search extension. | 
CodeInterpreterRuntimeConfig
| Fields | |
|---|---|
| file_input_gcs_bucket | 
 Optional. The Cloud Storage bucket for file input of this Extension. If specified, support input from the Cloud Storage bucket. Vertex Extension Custom Code Service Agent should be granted file reader to this bucket. If not specified, the extension will only accept file contents from request body and reject Cloud Storage file inputs. | 
| file_output_gcs_bucket | 
 Optional. The Cloud Storage bucket for file output of this Extension. If specified, write all output files to the Cloud Storage bucket. Vertex Extension Custom Code Service Agent should be granted file writer to this bucket. If not specified, the file content will be output in response body. | 
VertexAISearchRuntimeConfig
| Fields | |
|---|---|
| serving_config_name | 
 Optional. Vertex AI Search serving config name. Format:  | 
| engine_id | 
 Optional. Vertex AI Search engine ID. This is used to construct the search request. By setting this engine_id, API will construct the serving config using the default value to call search API for the user. The engine_id and serving_config_name cannot both be empty at the same time. | 
SafetyInput
Input for safety metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for safety metric. | 
| instance | Required. Safety instance. | 
SafetyInstance
Spec for safety instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
SafetyRating
Safety rating corresponding to the generated content.
| Fields | |
|---|---|
| category | Output only. Harm category. | 
| probability | Output only. Harm probability levels in the content. | 
| probability_score | 
 Output only. Harm probability score. | 
| severity | Output only. Harm severity levels in the content. | 
| severity_score | 
 Output only. Harm severity score. | 
| blocked | 
 Output only. Indicates whether the content was filtered out because of this rating. | 
| overwritten_threshold | Output only. The overwritten threshold for the safety category of Gemini 2.0 image out. If minors are detected in the output image, the threshold of each safety category will be overwritten if user sets a lower threshold. | 
HarmProbability
Harm probability levels in the content.
| Enums | |
|---|---|
| HARM_PROBABILITY_UNSPECIFIED | Harm probability unspecified. | 
| NEGLIGIBLE | Negligible level of harm. | 
| LOW | Low level of harm. | 
| MEDIUM | Medium level of harm. | 
| HIGH | High level of harm. | 
HarmSeverity
Harm severity levels.
| Enums | |
|---|---|
| HARM_SEVERITY_UNSPECIFIED | Harm severity unspecified. | 
| HARM_SEVERITY_NEGLIGIBLE | Negligible level of harm severity. | 
| HARM_SEVERITY_LOW | Low level of harm severity. | 
| HARM_SEVERITY_MEDIUM | Medium level of harm severity. | 
| HARM_SEVERITY_HIGH | High level of harm severity. | 
SafetyResult
Spec for safety result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for safety score. | 
| score | 
 Output only. Safety score. | 
| confidence | 
 Output only. Confidence for safety score. | 
SafetySetting
Safety settings.
| Fields | |
|---|---|
| category | Required. Harm category. | 
| threshold | Required. The harm block threshold. | 
| method | Optional. Specify if the threshold is used for probability or severity score. If not specified, the threshold is used for probability score. | 
HarmBlockMethod
Probability vs severity.
| Enums | |
|---|---|
| HARM_BLOCK_METHOD_UNSPECIFIED | The harm block method is unspecified. | 
| SEVERITY | The harm block method uses both probability and severity scores. | 
| PROBABILITY | The harm block method uses the probability score. | 
HarmBlockThreshold
Probability based thresholds levels for blocking.
| Enums | |
|---|---|
| HARM_BLOCK_THRESHOLD_UNSPECIFIED | Unspecified harm block threshold. | 
| BLOCK_LOW_AND_ABOVE | Block low threshold and above (i.e. block more). | 
| BLOCK_MEDIUM_AND_ABOVE | Block medium threshold and above. | 
| BLOCK_ONLY_HIGH | Block only high threshold (i.e. block less). | 
| BLOCK_NONE | Block none. | 
| OFF | Turn off the safety filter. | 
SafetySpec
Spec for safety metric.
| Fields | |
|---|---|
| version | 
 Optional. Which version to use for evaluation. | 
Schema
Schema is used to define the format of input/output data. Represents a select subset of an OpenAPI 3.0 schema object. More fields may be added in the future as needed.
| Fields | |
|---|---|
| type | Optional. The type of the data. | 
| format | 
 Optional. The format of the data. Supported formats: for NUMBER type: "float", "double" for INTEGER type: "int32", "int64" for STRING type: "email", "byte", etc | 
| title | 
 Optional. The title of the Schema. | 
| description | 
 Optional. The description of the data. | 
| nullable | 
 Optional. Indicates if the value may be null. | 
| default | Optional. Default value of the data. | 
| items | Optional. SCHEMA FIELDS FOR TYPE ARRAY Schema of the elements of Type.ARRAY. | 
| min_items | 
 Optional. Minimum number of the elements for Type.ARRAY. | 
| max_items | 
 Optional. Maximum number of the elements for Type.ARRAY. | 
| enum[] | 
 Optional. Possible values of the element of primitive type with enum format. Examples: 1. We can define direction as : {type:STRING, format:enum, enum:["EAST", NORTH", "SOUTH", "WEST"]} 2. We can define apartment number as : {type:INTEGER, format:enum, enum:["101", "201", "301"]} | 
| properties | 
 Optional. SCHEMA FIELDS FOR TYPE OBJECT Properties of Type.OBJECT. | 
| property_ordering[] | 
 Optional. The order of the properties. Not a standard field in open api spec. Only used to support the order of the properties. | 
| required[] | 
 Optional. Required properties of Type.OBJECT. | 
| min_properties | 
 Optional. Minimum number of the properties for Type.OBJECT. | 
| max_properties | 
 Optional. Maximum number of the properties for Type.OBJECT. | 
| minimum | 
 Optional. SCHEMA FIELDS FOR TYPE INTEGER and NUMBER Minimum value of the Type.INTEGER and Type.NUMBER | 
| maximum | 
 Optional. Maximum value of the Type.INTEGER and Type.NUMBER | 
| min_length | 
 Optional. SCHEMA FIELDS FOR TYPE STRING Minimum length of the Type.STRING | 
| max_length | 
 Optional. Maximum length of the Type.STRING | 
| pattern | 
 Optional. Pattern of the Type.STRING to restrict a string to a regular expression. | 
| example | Optional. Example of the object. Will only populated when the object is the root. | 
| any_of[] | Optional. The value should be validated against any (one or more) of the subschemas in the list. | 
| additional_properties | Optional. Can either be a boolean or an object; controls the presence of additional properties. | 
| ref | 
 Optional. Allows indirect references between schema nodes. The value should be a valid reference to a child of the root  For example, the following schema defines a reference to a schema node named "Pet": type: object properties: pet: ref: #/defs/Pet defs: Pet: type: object properties: name: type: string The value of the "pet" property is a reference to the schema node named "Pet". See details in https://json-schema.org/understanding-json-schema/structuring | 
| defs | 
 Optional. A map of definitions for use by  | 
SearchEntryPoint
Google search entry point.
| Fields | |
|---|---|
| rendered_content | 
 Optional. Web content snippet that can be embedded in a web page or an app webview. | 
| sdk_blob | 
 Optional. Base64 encoded JSON representing array of <search term, search url> tuple. | 
SecretEnvVar
Represents an environment variable where the value is a secret in Cloud Secret Manager.
| Fields | |
|---|---|
| name | 
 Required. Name of the secret environment variable. | 
| secret_ref | Required. Reference to a secret stored in the Cloud Secret Manager that will provide the value for this environment variable. | 
SecretRef
Reference to a secret stored in the Cloud Secret Manager that will provide the value for this environment variable.
| Fields | |
|---|---|
| secret | 
 Required. The name of the secret in Cloud Secret Manager. Format: {secret_name}. | 
| version | 
 The Cloud Secret Manager secret version. Can be 'latest' for the latest version, an integer for a specific version, or a version alias. | 
Segment
Segment of the content.
| Fields | |
|---|---|
| part_index | 
 Output only. The index of a Part object within its parent Content object. | 
| start_index | 
 Output only. Start index in the given Part, measured in bytes. Offset from the start of the Part, inclusive, starting at zero. | 
| end_index | 
 Output only. End index in the given Part, measured in bytes. Offset from the start of the Part, exclusive, starting at zero. | 
| text | 
 Output only. The text corresponding to the segment from the response. | 
Session
A session contains a set of actions between users and Vertex agents.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the session. Format: 'projects/{project}/locations/{location}/reasoningEngines/{reasoning_engine}/sessions/{session}'. | 
| create_time | Output only. Timestamp when the session was created. | 
| update_time | Output only. Timestamp when the session was updated. | 
| display_name | 
 Optional. The display name of the session. | 
| session_state | Optional. Session specific memory which stores key conversation points. | 
| user_id | 
 Required. Immutable. String id provided by the user | 
| Union field expiration. The expiration of the session.expirationcan be only one of the following: | |
| expire_time | Optional. Timestamp of when this session is considered expired. This is always provided on output, regardless of what was sent on input. | 
| ttl | Optional. Input only. The TTL for this session. | 
SessionEvent
An event represents a message from either the user or agent.
| Fields | |
|---|---|
| name | 
 Identifier. The resource name of the event. Format: | 
| author | 
 Required. The name of the agent that sent the event, or user. | 
| content | Optional. Content of the event provided by the author. | 
| invocation_id | 
 Required. The invocation id of the event, multiple events can have the same invocation id. | 
| actions | Optional. Actions executed by the agent. | 
| timestamp | Required. Timestamp when the event was created on client side. | 
| error_code | 
 Optional. Error code if the response is an error. Code varies by model. | 
| error_message | 
 Optional. Error message if the response is an error. | 
| event_metadata | Optional. Metadata relating to this event. | 
SessionResumptionConfig
Configuration of session resumption mechanism.
Included in BidiGenerateContentSetup.session_resumption. If included server will send SessionResumptionUpdate messages.
| Fields | |
|---|---|
| transparent | 
 Optional. If set requests server to send updates with message_index of last message sent from client included in session state. | 
| handle | 
 Session resumption handle of previous session (session to restore). If not present new session will be started. | 
SessionResumptionUpdate
Update of the session resumption state.
Only sent if BidiGenerateContentSetup.session_resumption was set.
| Fields | |
|---|---|
| new_handle | 
 New handle that represents state that can be resumed. Empty if  | 
| resumable | 
 True if session can be resumed at this point. It might be not possible to resume session at some points. In that case we send update empty new_handle and resumable=false. Example of such case could be model executing function calls or just generating. Resuming session (using previous session token) in such state will result in some data loss. | 
| last_consumed_client_message_index | 
 Index of last message sent by client that is included in state represented by this SessionResumptionToken. Only sent when  Presence of this index allows users to transparently reconnect and avoid issue of losing some part of realtime audio input/video. If client wishes to temporarily disconnect (for example as result of receiving GoAway) they can do it without losing state by buffering messages sent since last  It will not be used for 'resumption to restore state' some time later -- in those cases partial audio and video frames are likely not needed. | 
SlackSource
The Slack source for the ImportRagFilesRequest.
| Fields | |
|---|---|
| channels[] | Required. The Slack channels. | 
SlackChannels
SlackChannels contains the Slack channels and corresponding access token.
| Fields | |
|---|---|
| channels[] | Required. The Slack channel IDs. | 
| api_key_config | Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Slack channel access token that has access to the slack channel IDs. See: https://api.slack.com/tutorials/tracks/getting-a-token. | 
SlackChannel
SlackChannel contains the Slack channel ID and the time range to import.
| Fields | |
|---|---|
| channel_id | 
 Required. The Slack channel ID. | 
| start_time | Optional. The starting timestamp for messages to import. | 
| end_time | Optional. The ending timestamp for messages to import. | 
SpeakerVoiceConfig
Configuration for a single speaker in a multi speaker setup.
| Fields | |
|---|---|
| speaker | 
 Required. The name of the speaker. This should be the same as the speaker name used in the prompt. | 
| voice_config | Required. The configuration for the voice of this speaker. | 
SpeechConfig
The speech generation config.
| Fields | |
|---|---|
| voice_config | The configuration for the speaker to use. | 
| language_code | 
 Optional. Language code (ISO 639. e.g. en-US) for the speech synthesization. | 
| multi_speaker_voice_config | The configuration for a multi-speaker text-to-speech request. This field is mutually exclusive with  | 
StreamDirectPredictRequest
Request message for PredictionService.StreamDirectPredict.
The first message must contain endpoint field and optionally [input][]. The subsequent messages must contain [input][].
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| inputs[] | Optional. The prediction input. | 
| parameters | Optional. The parameters that govern the prediction. | 
StreamDirectPredictResponse
Response message for PredictionService.StreamDirectPredict.
| Fields | |
|---|---|
| outputs[] | The prediction output. | 
| parameters | The parameters that govern the prediction. | 
StreamDirectRawPredictRequest
Request message for PredictionService.StreamDirectRawPredict.
The first message must contain endpoint and method_name fields and optionally input. The subsequent messages must contain input. method_name in the subsequent messages have no effect.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| method_name | 
 Optional. Fully qualified name of the API method being invoked to perform predictions. Format:  | 
| input | 
 Optional. The prediction input. | 
StreamDirectRawPredictResponse
Response message for PredictionService.StreamDirectRawPredict.
| Fields | |
|---|---|
| output | 
 The prediction output. | 
StreamQueryReasoningEngineRequest
Request message for [ReasoningEngineExecutionService.StreamQuery][].
| Fields | |
|---|---|
| name | 
 Required. The name of the ReasoningEngine resource to use. Format:  | 
| input | Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc. | 
| class_method | 
 Optional. Class method to be used for the stream query. It is optional and defaults to "stream_query" if unspecified. | 
StreamRawPredictRequest
Request message for PredictionService.StreamRawPredict.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| http_body | The prediction input. Supports HTTP headers and arbitrary data payload. | 
StreamingPredictRequest
Request message for PredictionService.StreamingPredict.
The first message must contain endpoint field and optionally [input][]. The subsequent messages must contain [input][].
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| inputs[] | The prediction input. | 
| parameters | The parameters that govern the prediction. | 
StreamingPredictResponse
Response message for PredictionService.StreamingPredict.
| Fields | |
|---|---|
| outputs[] | The prediction output. | 
| parameters | The parameters that govern the prediction. | 
StreamingRawPredictRequest
Request message for PredictionService.StreamingRawPredict.
The first message must contain endpoint and method_name fields and optionally input. The subsequent messages must contain input. method_name in the subsequent messages have no effect.
| Fields | |
|---|---|
| endpoint | 
 Required. The name of the Endpoint requested to serve the prediction. Format:  | 
| method_name | 
 Fully qualified name of the API method being invoked to perform predictions. Format:  | 
| input | 
 The prediction input. | 
StreamingRawPredictResponse
Response message for PredictionService.StreamingRawPredict.
| Fields | |
|---|---|
| output | 
 The prediction output. | 
SummarizationHelpfulnessInput
Input for summarization helpfulness metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for summarization helpfulness score metric. | 
| instance | 
 Required. Summarization helpfulness instance. | 
SummarizationHelpfulnessInstance
Spec for summarization helpfulness instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Required. Text to be summarized. | 
| instruction | 
 Optional. Summarization prompt for LLM. | 
SummarizationHelpfulnessResult
Spec for summarization helpfulness result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for summarization helpfulness score. | 
| score | 
 Output only. Summarization Helpfulness score. | 
| confidence | 
 Output only. Confidence for summarization helpfulness score. | 
SummarizationHelpfulnessSpec
Spec for summarization helpfulness score metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute summarization helpfulness. | 
| version | 
 Optional. Which version to use for evaluation. | 
SummarizationQualityInput
Input for summarization quality metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for summarization quality score metric. | 
| instance | Required. Summarization quality instance. | 
SummarizationQualityInstance
Spec for summarization quality instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Required. Text to be summarized. | 
| instruction | 
 Required. Summarization prompt for LLM. | 
SummarizationQualityResult
Spec for summarization quality result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for summarization quality score. | 
| score | 
 Output only. Summarization Quality score. | 
| confidence | 
 Output only. Confidence for summarization quality score. | 
SummarizationQualitySpec
Spec for summarization quality score metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute summarization quality. | 
| version | 
 Optional. Which version to use for evaluation. | 
SummarizationVerbosityInput
Input for summarization verbosity metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for summarization verbosity score metric. | 
| instance | 
 Required. Summarization verbosity instance. | 
SummarizationVerbosityInstance
Spec for summarization verbosity instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Optional. Ground truth used to compare against the prediction. | 
| context | 
 Required. Text to be summarized. | 
| instruction | 
 Optional. Summarization prompt for LLM. | 
SummarizationVerbosityResult
Spec for summarization verbosity result.
| Fields | |
|---|---|
| explanation | 
 Output only. Explanation for summarization verbosity score. | 
| score | 
 Output only. Summarization Verbosity score. | 
| confidence | 
 Output only. Confidence for summarization verbosity score. | 
SummarizationVerbositySpec
Spec for summarization verbosity score metric.
| Fields | |
|---|---|
| use_reference | 
 Optional. Whether to use instance.reference to compute summarization verbosity. | 
| version | 
 Optional. Which version to use for evaluation. | 
SummaryMetrics
The summary metrics for the evaluation run.
| Fields | |
|---|---|
| metrics | 
 Optional. Map of metric name to metric value. | 
| total_items | 
 Optional. The total number of items that were evaluated. | 
| failed_items | 
 Optional. The number of items that failed to be evaluated. | 
SupervisedHyperParameters
Hyperparameters for SFT.
| Fields | |
|---|---|
| epoch_count | 
 Optional. Number of complete passes the model makes over the entire training dataset during training. | 
| learning_rate_multiplier | 
 Optional. Multiplier for adjusting the default learning rate. Mutually exclusive with  | 
| learning_rate | 
 Optional. Learning rate for tuning. Mutually exclusive with  | 
| adapter_size | Optional. Adapter size for tuning. | 
| batch_size | 
 Optional. Batch size for tuning. This feature is only available for open source models. | 
AdapterSize
Supported adapter sizes for tuning.
| Enums | |
|---|---|
| ADAPTER_SIZE_UNSPECIFIED | Adapter size is unspecified. | 
| ADAPTER_SIZE_ONE | Adapter size 1. | 
| ADAPTER_SIZE_TWO | Adapter size 2. | 
| ADAPTER_SIZE_FOUR | Adapter size 4. | 
| ADAPTER_SIZE_EIGHT | Adapter size 8. | 
| ADAPTER_SIZE_SIXTEEN | Adapter size 16. | 
| ADAPTER_SIZE_THIRTY_TWO | Adapter size 32. | 
SupervisedTuningDataStats
Tuning data statistics for Supervised Tuning.
| Fields | |
|---|---|
| tuning_dataset_example_count | 
 Output only. Number of examples in the tuning dataset. | 
| total_tuning_character_count | 
 Output only. Number of tuning characters in the tuning dataset. | 
| total_billable_character_count | 
 Output only. Number of billable characters in the tuning dataset. | 
| total_billable_token_count | 
 Output only. Number of billable tokens in the tuning dataset. | 
| tuning_step_count | 
 Output only. Number of tuning steps for this Tuning Job. | 
| user_input_token_distribution | 
 Output only. Dataset distributions for the user input tokens. | 
| user_output_token_distribution | 
 Output only. Dataset distributions for the user output tokens. | 
| user_message_per_example_distribution | 
 Output only. Dataset distributions for the messages per example. | 
| user_dataset_examples[] | Output only. Sample user messages in the training dataset uri. | 
| total_truncated_example_count | 
 Output only. The number of examples in the dataset that have been dropped. An example can be dropped for reasons including: too many tokens, contains an invalid image, contains too many images, etc. | 
| truncated_example_indices[] | 
 Output only. A partial sample of the indices (starting from 1) of the dropped examples. | 
| dropped_example_reasons[] | 
 Output only. For each index in  | 
SupervisedTuningDatasetDistribution
Dataset distribution for Supervised Tuning.
| Fields | |
|---|---|
| sum | 
 Output only. Sum of a given population of values. | 
| billable_sum | 
 Output only. Sum of a given population of values that are billable. | 
| min | 
 Output only. The minimum of the population values. | 
| max | 
 Output only. The maximum of the population values. | 
| mean | 
 Output only. The arithmetic mean of the values in the population. | 
| median | 
 Output only. The median of the values in the population. | 
| p5 | 
 Output only. The 5th percentile of the values in the population. | 
| p95 | 
 Output only. The 95th percentile of the values in the population. | 
| buckets[] | Output only. Defines the histogram bucket. | 
DatasetBucket
Dataset bucket used to create a histogram for the distribution given a population of values.
| Fields | |
|---|---|
| count | 
 Output only. Number of values in the bucket. | 
| left | 
 Output only. Left bound of the bucket. | 
| right | 
 Output only. Right bound of the bucket. | 
SupervisedTuningSpec
Tuning Spec for Supervised Tuning for first party models.
| Fields | |
|---|---|
| training_dataset_uri | 
 Required. Training dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset. | 
| validation_dataset_uri | 
 Optional. Validation dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset. | 
| hyper_parameters | Optional. Hyperparameters for SFT. | 
| export_last_checkpoint_only | 
 Optional. If set to true, disable intermediate checkpoints for SFT and only the last checkpoint will be exported. Otherwise, enable intermediate checkpoints for SFT. Default is false. | 
| evaluation_config | Optional. Evaluation Config for Tuning Job. | 
| tuning_mode | Tuning mode. | 
TuningMode
Supported tuning modes.
| Enums | |
|---|---|
| TUNING_MODE_UNSPECIFIED | Tuning mode is unspecified. | 
| TUNING_MODE_FULL | Full fine-tuning mode. | 
| TUNING_MODE_PEFT_ADAPTER | PEFT adapter tuning mode. | 
SyntheticExample
Represents a single synthetic example, composed of multiple fields. Used for providing few-shot examples in the request and for returning generated examples in the response.
| Fields | |
|---|---|
| fields[] | Required. A list of fields that constitute an example. | 
SyntheticField
Represents a single named field within a SyntheticExample.
| Fields | |
|---|---|
| field_name | 
 Optional. The name of the field. | 
| content | Required. The content of the field. | 
TaskDescriptionStrategy
Defines a generation strategy based on a high-level task description.
| Fields | |
|---|---|
| task_description | 
 Required. A high-level description of the synthetic data to be generated. | 
Tensor
A tensor value type.
| Fields | |
|---|---|
| dtype | The data type of tensor. | 
| shape[] | 
 Shape of the tensor. | 
| bool_val[] | 
 Type specific representations that make it easy to create tensor protos in all languages. Only the representation corresponding to "dtype" can be set. The values hold the flattened representation of the tensor in row major order. | 
| string_val[] | 
 | 
| bytes_val[] | 
 | 
| float_val[] | 
 | 
| double_val[] | 
 | 
| int_val[] | 
 | 
| int64_val[] | 
 | 
| uint_val[] | 
 | 
| uint64_val[] | 
 | 
| list_val[] | A list of tensor values. | 
| struct_val | 
 A map of string to tensor. | 
| tensor_val | 
 Serialized raw tensor content. | 
DataType
Data type of the tensor.
| Enums | |
|---|---|
| DATA_TYPE_UNSPECIFIED | Not a legal value for DataType. Used to indicate a DataType field has not been set. | 
| BOOL | Data types that all computation devices are expected to be capable to support. | 
| STRING | |
| FLOAT | |
| DOUBLE | |
| INT8 | |
| INT16 | |
| INT32 | |
| INT64 | |
| UINT8 | |
| UINT16 | |
| UINT32 | |
| UINT64 | |
TokensInfo
Tokens info with a list of tokens and the corresponding list of token ids.
| Fields | |
|---|---|
| tokens[] | 
 A list of tokens from the input. | 
| token_ids[] | 
 A list of token ids from the input. | 
| role | 
 Optional. Optional fields for the role from the corresponding Content. | 
Tool
Tool details that the model may use to generate response.
A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. A Tool object should contain exactly one type of Tool (e.g FunctionDeclaration, Retrieval or GoogleSearchRetrieval).
| Fields | |
|---|---|
| function_declarations[] | Optional. Function tool type. One or more function declarations to be passed to the model along with the current user query. Model may decide to call a subset of these functions by populating  | 
| retrieval | Optional. Retrieval tool type. System will always execute the provided retrieval tool(s) to get external knowledge to answer the prompt. Retrieval results are presented to the model for generation. | 
| google_search | Optional. GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google. | 
| google_search_retrieval | Optional. Specialized retrieval tool that is powered by Google Search. | 
| google_maps | Optional. GoogleMaps tool type. Tool to support Google Maps in Model. | 
| enterprise_web_search | Optional. Tool to support searching public web data, powered by Vertex AI Search and Sec4 compliance. | 
| code_execution | Optional. CodeExecution tool type. Enables the model to execute code as part of generation. | 
| url_context | Optional. Tool to support URL context retrieval. | 
| computer_use | Optional. Tool to support the model interacting directly with the computer. If enabled, it automatically populates computer-use specific Function Declarations. | 
CodeExecution
This type has no fields.
Tool that executes code generated by the model, and automatically returns the result to the model.
See also [ExecutableCode]and [CodeExecutionResult] which are input and output to this tool.
ComputerUse
Tool to support computer use.
| Fields | |
|---|---|
| environment | Required. The environment being operated. | 
| excluded_predefined_functions[] | 
 Optional. By default, predefined functions are included in the final model call. Some of them can be explicitly excluded from being automatically included. This can serve two purposes: 1. Using a more restricted / different action space. 2. Improving the definitions / instructions of predefined functions. | 
Environment
Represents the environment being operated, such as a web browser.
| Enums | |
|---|---|
| ENVIRONMENT_UNSPECIFIED | Defaults to browser. | 
| ENVIRONMENT_BROWSER | Operates in a web browser. | 
GoogleSearch
GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.
| Fields | |
|---|---|
| exclude_domains[] | 
 Optional. List of domains to be excluded from the search results. The default limit is 2000 domains. Example: ["amazon.com", "facebook.com"]. | 
| blocking_confidence | Optional. Sites with confidence level chosen & above this value will be blocked from the search results. | 
PhishBlockThreshold
These are available confidence level user can set to block malicious urls with chosen confidence and above. For understanding different confidence of webrisk, please refer to https://cloud.google.com/web-risk/docs/reference/rpc/google.cloud.webrisk.v1eap1#confidencelevel
| Enums | |
|---|---|
| PHISH_BLOCK_THRESHOLD_UNSPECIFIED | Defaults to unspecified. | 
| BLOCK_LOW_AND_ABOVE | Blocks Low and above confidence URL that is risky. | 
| BLOCK_MEDIUM_AND_ABOVE | Blocks Medium and above confidence URL that is risky. | 
| BLOCK_HIGH_AND_ABOVE | Blocks High and above confidence URL that is risky. | 
| BLOCK_HIGHER_AND_ABOVE | Blocks Higher and above confidence URL that is risky. | 
| BLOCK_VERY_HIGH_AND_ABOVE | Blocks Very high and above confidence URL that is risky. | 
| BLOCK_ONLY_EXTREMELY_HIGH | Blocks Extremely high confidence URL that is risky. | 
ToolCall
Spec for tool call.
| Fields | |
|---|---|
| tool_name | 
 Required. Spec for tool name | 
| tool_input | 
 Optional. Spec for tool input | 
ToolCallValidInput
Input for tool call valid metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for tool call valid metric. | 
| instances[] | Required. Repeated tool call valid instances. | 
ToolCallValidInstance
Spec for tool call valid instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
ToolCallValidMetricValue
Tool call valid metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Tool call valid score. | 
ToolCallValidResults
Results for tool call valid metric.
| Fields | |
|---|---|
| tool_call_valid_metric_values[] | Output only. Tool call valid metric values. | 
ToolCallValidSpec
This type has no fields.
Spec for tool call valid metric.
ToolConfig
Tool config. This config is shared for all tools provided in the request.
| Fields | |
|---|---|
| function_calling_config | Optional. Function calling config. | 
| retrieval_config | Optional. Retrieval config. | 
ToolNameMatchInput
Input for tool name match metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for tool name match metric. | 
| instances[] | Required. Repeated tool name match instances. | 
ToolNameMatchInstance
Spec for tool name match instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
ToolNameMatchMetricValue
Tool name match metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Tool name match score. | 
ToolNameMatchResults
Results for tool name match metric.
| Fields | |
|---|---|
| tool_name_match_metric_values[] | Output only. Tool name match metric values. | 
ToolNameMatchSpec
This type has no fields.
Spec for tool name match metric.
ToolParameterKVMatchInput
Input for tool parameter key value match metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for tool parameter key value match metric. | 
| instances[] | Required. Repeated tool parameter key value match instances. | 
ToolParameterKVMatchInstance
Spec for tool parameter key value match instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
ToolParameterKVMatchMetricValue
Tool parameter key value match metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Tool parameter key value match score. | 
ToolParameterKVMatchResults
Results for tool parameter key value match metric.
| Fields | |
|---|---|
| tool_parameter_kv_match_metric_values[] | 
 Output only. Tool parameter key value match metric values. | 
ToolParameterKVMatchSpec
Spec for tool parameter key value match metric.
| Fields | |
|---|---|
| use_strict_string_match | 
 Optional. Whether to use STRICT string match on parameter values. | 
ToolParameterKeyMatchInput
Input for tool parameter key match metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for tool parameter key match metric. | 
| instances[] | Required. Repeated tool parameter key match instances. | 
ToolParameterKeyMatchInstance
Spec for tool parameter key match instance.
| Fields | |
|---|---|
| prediction | 
 Required. Output of the evaluated model. | 
| reference | 
 Required. Ground truth used to compare against the prediction. | 
ToolParameterKeyMatchMetricValue
Tool parameter key match metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. Tool parameter key match score. | 
ToolParameterKeyMatchResults
Results for tool parameter key match metric.
| Fields | |
|---|---|
| tool_parameter_key_match_metric_values[] | 
 Output only. Tool parameter key match metric values. | 
ToolParameterKeyMatchSpec
This type has no fields.
Spec for tool parameter key match metric.
ToolUseExample
A single example of the tool usage.
| Fields | |
|---|---|
| display_name | 
 Required. The display name for example. | 
| query | 
 Required. Query that should be routed to this tool. | 
| request_params | Request parameters used for executing this tool. | 
| response_params | Response parameters generated by this tool. | 
| response_summary | 
 Summary of the tool response to the user query. | 
| Union field Target. Target tool to use.Targetcan be only one of the following: | |
| extension_operation | Extension operation to call. | 
| function_name | 
 Function name to call. | 
ExtensionOperation
Identifies one operation of the extension.
| Fields | |
|---|---|
| extension | 
 Resource name of the extension. | 
| operation_id | 
 Required. Operation ID of the extension. | 
Trajectory
Spec for trajectory.
| Fields | |
|---|---|
| tool_calls[] | Required. Tool calls in the trajectory. | 
TrajectoryAnyOrderMatchInput
Instances and metric spec for TrajectoryAnyOrderMatch metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for TrajectoryAnyOrderMatch metric. | 
| instances[] | 
 Required. Repeated TrajectoryAnyOrderMatch instance. | 
TrajectoryAnyOrderMatchInstance
Spec for TrajectoryAnyOrderMatch instance.
| Fields | |
|---|---|
| predicted_trajectory | Required. Spec for predicted tool call trajectory. | 
| reference_trajectory | Required. Spec for reference tool call trajectory. | 
TrajectoryAnyOrderMatchMetricValue
TrajectoryAnyOrderMatch metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. TrajectoryAnyOrderMatch score. | 
TrajectoryAnyOrderMatchResults
Results for TrajectoryAnyOrderMatch metric.
| Fields | |
|---|---|
| trajectory_any_order_match_metric_values[] | 
 Output only. TrajectoryAnyOrderMatch metric values. | 
TrajectoryAnyOrderMatchSpec
This type has no fields.
Spec for TrajectoryAnyOrderMatch metric - returns 1 if all tool calls in the reference trajectory appear in the predicted trajectory in any order, else 0.
TrajectoryExactMatchInput
Instances and metric spec for TrajectoryExactMatch metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for TrajectoryExactMatch metric. | 
| instances[] | Required. Repeated TrajectoryExactMatch instance. | 
TrajectoryExactMatchInstance
Spec for TrajectoryExactMatch instance.
| Fields | |
|---|---|
| predicted_trajectory | Required. Spec for predicted tool call trajectory. | 
| reference_trajectory | Required. Spec for reference tool call trajectory. | 
TrajectoryExactMatchMetricValue
TrajectoryExactMatch metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. TrajectoryExactMatch score. | 
TrajectoryExactMatchResults
Results for TrajectoryExactMatch metric.
| Fields | |
|---|---|
| trajectory_exact_match_metric_values[] | 
 Output only. TrajectoryExactMatch metric values. | 
TrajectoryExactMatchSpec
This type has no fields.
Spec for TrajectoryExactMatch metric - returns 1 if tool calls in the reference trajectory exactly match the predicted trajectory, else 0.
TrajectoryInOrderMatchInput
Instances and metric spec for TrajectoryInOrderMatch metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for TrajectoryInOrderMatch metric. | 
| instances[] | 
 Required. Repeated TrajectoryInOrderMatch instance. | 
TrajectoryInOrderMatchInstance
Spec for TrajectoryInOrderMatch instance.
| Fields | |
|---|---|
| predicted_trajectory | Required. Spec for predicted tool call trajectory. | 
| reference_trajectory | Required. Spec for reference tool call trajectory. | 
TrajectoryInOrderMatchMetricValue
TrajectoryInOrderMatch metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. TrajectoryInOrderMatch score. | 
TrajectoryInOrderMatchResults
Results for TrajectoryInOrderMatch metric.
| Fields | |
|---|---|
| trajectory_in_order_match_metric_values[] | 
 Output only. TrajectoryInOrderMatch metric values. | 
TrajectoryInOrderMatchSpec
This type has no fields.
Spec for TrajectoryInOrderMatch metric - returns 1 if tool calls in the reference trajectory appear in the predicted trajectory in the same order, else 0.
TrajectoryPrecisionInput
Instances and metric spec for TrajectoryPrecision metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for TrajectoryPrecision metric. | 
| instances[] | Required. Repeated TrajectoryPrecision instance. | 
TrajectoryPrecisionInstance
Spec for TrajectoryPrecision instance.
| Fields | |
|---|---|
| predicted_trajectory | Required. Spec for predicted tool call trajectory. | 
| reference_trajectory | Required. Spec for reference tool call trajectory. | 
TrajectoryPrecisionMetricValue
TrajectoryPrecision metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. TrajectoryPrecision score. | 
TrajectoryPrecisionResults
Results for TrajectoryPrecision metric.
| Fields | |
|---|---|
| trajectory_precision_metric_values[] | 
 Output only. TrajectoryPrecision metric values. | 
TrajectoryPrecisionSpec
This type has no fields.
Spec for TrajectoryPrecision metric - returns a float score based on average precision of individual tool calls.
TrajectoryRecallInput
Instances and metric spec for TrajectoryRecall metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for TrajectoryRecall metric. | 
| instances[] | Required. Repeated TrajectoryRecall instance. | 
TrajectoryRecallInstance
Spec for TrajectoryRecall instance.
| Fields | |
|---|---|
| predicted_trajectory | Required. Spec for predicted tool call trajectory. | 
| reference_trajectory | Required. Spec for reference tool call trajectory. | 
TrajectoryRecallMetricValue
TrajectoryRecall metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. TrajectoryRecall score. | 
TrajectoryRecallResults
Results for TrajectoryRecall metric.
| Fields | |
|---|---|
| trajectory_recall_metric_values[] | Output only. TrajectoryRecall metric values. | 
TrajectoryRecallSpec
This type has no fields.
Spec for TrajectoryRecall metric - returns a float score based on average recall of individual tool calls.
TrajectorySingleToolUseInput
Instances and metric spec for TrajectorySingleToolUse metric.
| Fields | |
|---|---|
| metric_spec | Required. Spec for TrajectorySingleToolUse metric. | 
| instances[] | 
 Required. Repeated TrajectorySingleToolUse instance. | 
TrajectorySingleToolUseInstance
Spec for TrajectorySingleToolUse instance.
| Fields | |
|---|---|
| predicted_trajectory | Required. Spec for predicted tool call trajectory. | 
TrajectorySingleToolUseMetricValue
TrajectorySingleToolUse metric value for an instance.
| Fields | |
|---|---|
| score | 
 Output only. TrajectorySingleToolUse score. | 
TrajectorySingleToolUseResults
Results for TrajectorySingleToolUse metric.
| Fields | |
|---|---|
| trajectory_single_tool_use_metric_values[] | 
 Output only. TrajectorySingleToolUse metric values. | 
TrajectorySingleToolUseSpec
Spec for TrajectorySingleToolUse metric - returns 1 if tool is present in the predicted trajectory, else 0.
| Fields | |
|---|---|
| tool_name | 
 Required. Spec for tool name to be checked for in the predicted trajectory. | 
TunedModel
The Model Registry Model and Online Prediction Endpoint associated with this TuningJob.
| Fields | |
|---|---|
| model | 
 Output only. The resource name of the TunedModel. Format: 
 When tuning from a base model, the version ID will be 1. For continuous tuning, if the provided tuned_model_display_name is set and different from parent model's display name, the tuned model will have a new parent model with version 1. Otherwise the version id will be incremented by 1 from the last version ID in the parent model. E.g., 
 | 
| endpoint | 
 Output only. A resource name of an Endpoint. Format:  | 
| checkpoints[] | Output only. The checkpoints associated with this TunedModel. This field is only populated for tuning jobs that enable intermediate checkpoints. | 
TunedModelCheckpoint
TunedModelCheckpoint for the Tuned Model of a Tuning Job.
| Fields | |
|---|---|
| checkpoint_id | 
 The ID of the checkpoint. | 
| epoch | 
 The epoch of the checkpoint. | 
| step | 
 The step of the checkpoint. | 
| endpoint | 
 The Endpoint resource name that the checkpoint is deployed to. Format:  | 
TunedModelRef
TunedModel Reference for legacy model migration.
| Fields | |
|---|---|
| Union field tuned_model_ref. The Tuned Model Reference for the model.tuned_model_refcan be only one of the following: | |
| tuned_model | 
 Support migration from model registry. | 
| tuning_job | 
 Support migration from tuning job list page, from gemini-1.0-pro-002 to 1.5 and above. | 
| pipeline_job | 
 Support migration from tuning job list page, from bison model to gemini model. | 
TuningDataStats
The tuning data statistic values for TuningJob.
| Fields | |
|---|---|
| Union field  
 | |
| supervised_tuning_data_stats | The SFT Tuning data stats. | 
| distillation_data_stats | Output only. Statistics for distillation. | 
| preference_optimization_data_stats | 
 Output only. Statistics for preference optimization. | 
TuningJob
Represents a TuningJob that runs with Google owned models.
| Fields | |
|---|---|
| name | 
 Output only. Identifier. Resource name of a TuningJob. Format:  | 
| tuned_model_display_name | 
 Optional. The display name of the  | 
| description | 
 Optional. The description of the  | 
| custom_base_model | 
 Optional. The user-provided path to custom model weights. Set this field to tune a custom model. The path must be a Cloud Storage directory that contains the model weights in .safetensors format along with associated model metadata files. If this field is set, the base_model field must still be set to indicate which base model the custom model is derived from. This feature is only available for open source models. | 
| state | Output only. The detailed state of the job. | 
| tuning_job_state | Output only. The detail state of the tuning job (while the overall  | 
| create_time | Output only. Time when the  | 
| start_time | Output only. Time when the  | 
| end_time | Output only. Time when the TuningJob entered any of the following  | 
| update_time | Output only. Time when the  | 
| error | Output only. Only populated when job's state is  | 
| labels | 
 Optional. The labels with user-defined metadata to organize  Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. | 
| experiment | 
 Output only. The Experiment associated with this  | 
| tuned_model | Output only. The tuned model resources associated with this  | 
| tuning_data_stats | Output only. The tuning data statistics associated with this  | 
| pipeline_job | 
 Output only. The resource name of the PipelineJob associated with the  | 
| encryption_spec | Customer-managed encryption key options for a TuningJob. If this is set, then all resources created by the TuningJob will be encrypted with the provided encryption key. | 
| service_account | 
 The service account that the tuningJob workload runs as. If not specified, the Vertex AI Secure Fine-Tuned Service Agent in the project will be used. See https://cloud.google.com/iam/docs/service-agents#vertex-ai-secure-fine-tuning-service-agent Users starting the pipeline must have the  | 
| output_uri | 
 Optional. Cloud Storage path to the directory where tuning job outputs are written to. This field is only available and required for open source models. | 
| evaluate_dataset_runs[] | Output only. Evaluation runs for the Tuning Job. | 
| satisfies_pzs | 
 Output only. Reserved for future use. | 
| satisfies_pzi | 
 Output only. Reserved for future use. | 
| Union field  
 | |
| base_model | 
 The base model that is being tuned. See Supported models. | 
| pre_tuned_model | The pre-tuned model for continuous tuning. | 
| Union field  
 | |
| supervised_tuning_spec | Tuning Spec for Supervised Fine Tuning. | 
| distillation_spec | Tuning Spec for Distillation. | 
| partner_model_tuning_spec | Tuning Spec for open sourced and third party Partner models. | 
| preference_optimization_spec | Tuning Spec for Preference Optimization. | 
| veo_tuning_spec | Tuning Spec for Veo Tuning. | 
| full_fine_tuning_spec | Tuning Spec for Full Fine Tuning. | 
TuningJobState
Represents the detailed state of the tuning job while the overall JobState is running.
| Enums | |
|---|---|
| TUNING_JOB_STATE_UNSPECIFIED | Default tuning job state. | 
| TUNING_JOB_STATE_WAITING_FOR_QUOTA | Tuning job is waiting for job quota. | 
| TUNING_JOB_STATE_PROCESSING_DATASET | Tuning job is validating the dataset. | 
| TUNING_JOB_STATE_WAITING_FOR_CAPACITY | Tuning job is waiting for hardware capacity. | 
| TUNING_JOB_STATE_TUNING | Tuning job is running. | 
| TUNING_JOB_STATE_POST_PROCESSING | Tuning job is doing some post processing steps. | 
Type
Type contains the list of OpenAPI data types as defined by https://swagger.io/docs/specification/data-models/data-types/
| Enums | |
|---|---|
| TYPE_UNSPECIFIED | Not specified, should not be used. | 
| STRING | OpenAPI string type | 
| NUMBER | OpenAPI number type | 
| INTEGER | OpenAPI integer type | 
| BOOLEAN | OpenAPI boolean type | 
| ARRAY | OpenAPI array type | 
| OBJECT | OpenAPI object type | 
| NULL | Null type | 
UpdateCacheConfigRequest
Request message for updating a cache config.
| Fields | |
|---|---|
| cache_config | Required. The cache config to be updated.  | 
UpdateCachedContentRequest
Request message for GenAiCacheService.UpdateCachedContent. Only expire_time or ttl can be updated.
| Fields | |
|---|---|
| cached_content | Required. The cached content to update | 
| update_mask | Required. The list of fields to update. | 
UpdateEvaluationSetRequest
Request message for EvaluationManagementService.UpdateEvaluationSet.
| Fields | |
|---|---|
| evaluation_set | Required. The EvaluationSet to update. The EvaluationSet's  | 
| update_mask | Optional. The update mask applies to the resource. For the  | 
UpdateExtensionRequest
Request message for ExtensionRegistryService.UpdateExtension.
| Fields | |
|---|---|
| extension | Required. The Extension which replaces the resource on the server. | 
| update_mask | Required. Mask specifying which fields to update. Supported fields: 
 | 
UpdateMemoryOperationMetadata
Details of MemoryBankService.UpdateMemory operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
UpdateMemoryRequest
Request message for MemoryBankService.UpdateMemory.
| Fields | |
|---|---|
| memory | Required. The Memory which replaces the resource on the server. | 
| update_mask | Optional. Mask specifying which fields to update. Supported fields: 
 | 
UpdateRagCorpusOperationMetadata
Runtime operation information for VertexRagDataService.UpdateRagCorpus.
| Fields | |
|---|---|
| generic_metadata | The operation generic information. | 
UpdateRagCorpusRequest
Request message for VertexRagDataService.UpdateRagCorpus.
| Fields | |
|---|---|
| rag_corpus | Required. The RagCorpus which replaces the resource on the server. | 
UpdateRagEngineConfigOperationMetadata
Runtime operation information for VertexRagDataService.UpdateRagEngineConfig.
| Fields | |
|---|---|
| generic_metadata | The operation generic information. | 
UpdateRagEngineConfigRequest
Request message for VertexRagDataService.UpdateRagEngineConfig.
| Fields | |
|---|---|
| rag_engine_config | Required. The updated RagEngineConfig. NOTE: Downgrading your RagManagedDb's ComputeTier could temporarily increase request latencies until the operation is fully complete. | 
UpdateReasoningEngineOperationMetadata
Details of ReasoningEngineService.UpdateReasoningEngine operation.
| Fields | |
|---|---|
| generic_metadata | The common part of the operation metadata. | 
UpdateReasoningEngineRequest
Request message for ReasoningEngineService.UpdateReasoningEngine.
| Fields | |
|---|---|
| reasoning_engine | Required. The ReasoningEngine which replaces the resource on the server. | 
| update_mask | Optional. Mask specifying which fields to update. | 
UpdateSessionRequest
Request message for SessionService.UpdateSession.
| Fields | |
|---|---|
| session | Required. The session to update. Format:  | 
| update_mask | Optional. Field mask is used to control which fields get updated. If the mask is not present, all fields will be updated. | 
UploadRagFileConfig
Config for uploading RagFile.
| Fields | |
|---|---|
| rag_file_chunking_config | Specifies the size and overlap of chunks after uploading RagFile. | 
| rag_file_transformation_config | Specifies the transformation config for RagFiles. | 
| rag_file_metadata_config | Specifies the metadata config for RagFiles. Including paths for metadata schema and metadata. Alteratively, inline metadata schema and metadata can be provided. | 
| rag_file_parsing_config | Optional. Specifies the parsing config for RagFiles. RAG will use the default parser if this field is not set. | 
UrlContext
This type has no fields.
Tool to support URL context.
UrlContextMetadata
Metadata related to url context retrieval tool.
| Fields | |
|---|---|
| url_metadata[] | Output only. List of url context. | 
UrlMetadata
Context of the a single url retrieval.
| Fields | |
|---|---|
| retrieved_url | 
 Retrieved url by the tool. | 
| url_retrieval_status | Status of the url retrieval. | 
UrlRetrievalStatus
Status of the url retrieval.
| Enums | |
|---|---|
| URL_RETRIEVAL_STATUS_UNSPECIFIED | Default value. This value is unused. | 
| URL_RETRIEVAL_STATUS_SUCCESS | Url retrieval is successful. | 
| URL_RETRIEVAL_STATUS_ERROR | Url retrieval is failed due to error. | 
UsageMetadata
Usage metadata about the content generation request and response. This message provides a detailed breakdown of token usage and other relevant metrics.
| Fields | |
|---|---|
| prompt_token_count | 
 The total number of tokens in the prompt. This includes any text, images, or other media provided in the request. When  | 
| candidates_token_count | 
 The total number of tokens in the generated candidates. | 
| total_token_count | 
 The total number of tokens for the entire request. This is the sum of  | 
| tool_use_prompt_token_count | 
 Output only. The number of tokens in the results from tool executions, which are provided back to the model as input, if applicable. | 
| thoughts_token_count | 
 Output only. The number of tokens that were part of the model's generated "thoughts" output, if applicable. | 
| cached_content_token_count | 
 Output only. The number of tokens in the cached content that was used for this request. | 
| prompt_tokens_details[] | Output only. A detailed breakdown of the token count for each modality in the prompt. | 
| cache_tokens_details[] | Output only. A detailed breakdown of the token count for each modality in the cached content. | 
| candidates_tokens_details[] | Output only. A detailed breakdown of the token count for each modality in the generated candidates. | 
| tool_use_prompt_tokens_details[] | Output only. A detailed breakdown by modality of the token counts from the results of tool executions, which are provided back to the model as input. | 
| traffic_type | Output only. The traffic type for this request. | 
TrafficType
The type of traffic that this request was processed with, indicating which quota gets consumed.
| Enums | |
|---|---|
| TRAFFIC_TYPE_UNSPECIFIED | Unspecified request traffic type. | 
| ON_DEMAND | Type for Pay-As-You-Go traffic. | 
| PROVISIONED_THROUGHPUT | Type for Provisioned Throughput traffic. | 
VeoHyperParameters
Hyperparameters for Veo.
| Fields | |
|---|---|
| epoch_count | 
 Optional. Number of complete passes the model makes over the entire training dataset during training. | 
| learning_rate_multiplier | 
 Optional. Multiplier for adjusting the default learning rate. | 
| tuning_task | Optional. The tuning task. Either I2V or T2V. | 
TuningTask
An enum defining the tuning task used for Veo.
| Enums | |
|---|---|
| TUNING_TASK_UNSPECIFIED | Default value. This value is unused. | 
| TUNING_TASK_I2V | Tuning task for image to video. | 
| TUNING_TASK_T2V | Tuning task for text to video. | 
| TUNING_TASK_R2V | Tuning task for reference to video. | 
VeoTuningSpec
Tuning Spec for Veo Model Tuning.
| Fields | |
|---|---|
| training_dataset_uri | 
 Required. Training dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset. | 
| validation_dataset_uri | 
 Optional. Validation dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset. | 
| hyper_parameters | Optional. Hyperparameters for Veo. | 
VertexAISearch
Retrieve from Vertex AI Search datastore or engine for grounding. datastore and engine are mutually exclusive. See https://cloud.google.com/products/agent-builder
| Fields | |
|---|---|
| datastore | 
 Optional. Fully-qualified Vertex AI Search data store resource ID. Format:  | 
| engine | 
 Optional. Fully-qualified Vertex AI Search engine resource ID. Format:  | 
| max_results | 
 Optional. Number of search results to return per query. The default value is 10. The maximumm allowed value is 10. | 
| filter | 
 Optional. Filter strings to be passed to the search API. | 
| data_store_specs[] | Specifications that define the specific DataStores to be searched, along with configurations for those data stores. This is only considered for Engines with multiple data stores. It should only be set if engine is used. | 
DataStoreSpec
Define data stores within engine to filter on in a search call and configurations for those data stores. For more information, see https://cloud.google.com/generative-ai-app-builder/docs/reference/rpc/google.cloud.discoveryengine.v1#datastorespec
| Fields | |
|---|---|
| data_store | 
 Full resource name of DataStore, such as Format:  | 
| filter | 
 Optional. Filter specification to filter documents in the data store specified by data_store field. For more information on filtering, see Filtering | 
VertexAiSearchConfig
Config for the Vertex AI Search.
| Fields | |
|---|---|
| serving_config | 
 Vertex AI Search Serving Config resource full name. For example,  | 
VertexRagStore
Retrieve from Vertex RAG Store for grounding.
| Fields | |
|---|---|
| rag_corpora[] | 
 Optional. Deprecated. Please use rag_resources instead. | 
| rag_resources[] | Optional. The representation of the rag source. It can be used to specify corpus only or ragfiles. Currently only support one corpus or multiple files from one corpus. In the future we may open up multiple corpora support. | 
| rag_retrieval_config | Optional. The retrieval config for the Rag query. | 
| store_context | 
 Optional. Currently only supported for Gemini Multimodal Live API. In Gemini Multimodal Live API, if  | 
| similarity_top_k | 
 Optional. Number of top k results to return from the selected corpora. | 
| vector_distance_threshold | 
 Optional. Only return results with vector distance smaller than the threshold. | 
RagResource
The definition of the Rag resource.
| Fields | |
|---|---|
| rag_corpus | 
 Optional. RagCorpora resource name. Format:  | 
| rag_file_ids[] | 
 Optional. rag_file_id. The files should be in the same rag_corpus set in rag_corpus field. | 
VideoMetadata
Metadata describes the input video content.
| Fields | |
|---|---|
| start_offset | Optional. The start offset of the video. | 
| end_offset | Optional. The end offset of the video. | 
| fps | 
 Optional. The frame rate of the video sent to the model. If not specified, the default value will be 1.0. The fps range is (0.0, 24.0]. | 
VoiceConfig
The configuration for the voice to use.
| Fields | |
|---|---|
| Union field voice_config. The configuration for the speaker to use.voice_configcan be only one of the following: | |
| prebuilt_voice_config | The configuration for the prebuilt voice to use. |