DocumentAIProcessorVersion
| Property | Value |
|---|---|
| Google Cloud Service Name | Document AI |
| Google Cloud Service Documentation | /document-ai/docs/ |
| Google Cloud REST Resource Name | projects.locations.processors.processorVersions |
| Google Cloud REST Resource Documentation | /document-ai/docs/reference/rest/v1/projects.locations.processors.processorVersions |
| Config Connector Resource Short Names | gcpdocumentaiprocessorversion gcpdocumentaiprocessorversions documentaiprocessorversion |
| Config Connector Service Name | documentai.googleapis.com |
| Config Connector Resource Fully Qualified Name | documentaiprocessorversions.documentai.cnrm.cloud.google.com |
| Can Be Referenced by IAMPolicy/IAMPolicyMember | No |
| Config Connector Default Average Reconcile Interval In Seconds | 600 |
Custom Resource Definition Properties
Spec
Schema
deprecationInfo:
deprecationTime: string
replacementProcessorVersion: string
displayName: string
kmsKeyNameRef:
external: string
name: string
namespace: string
kmsKeyVersionNameRef:
external: string
processorRef:
external: string
name: string
namespace: string
resourceID: string
| Fields | |
|---|---|
|
Optional |
If set, information about the eventual deprecation of this version. |
|
Optional |
The time at which this processor version will be retired. |
|
Optional |
If set, the processor version that will be used as a replacement. |
|
Optional |
The display name of the processor version. |
|
Optional |
The KMS key name used for encryption. |
|
Optional |
A reference to an externally managed KMSCryptoKey. Should be in the format `projects/[kms_project_id]/locations/[region]/keyRings/[key_ring_id]/cryptoKeys/[key]`. |
|
Optional |
The `name` of a `KMSCryptoKey` resource. |
|
Optional |
The `namespace` of a `KMSCryptoKey` resource. |
|
Optional |
The KMS key version with which data is encrypted. |
|
Optional |
A reference to an externally managed cryptoKeyVersion. Should be in the format `projects/{{kms_project_id}}/locations/{{region}}/keyRings/{{key_ring_id}}/cryptoKeys/{{key}}/cryptoKeyVersions/{{version}}`. |
|
Required |
ProcessorRef defines the resource reference to DocumentAIProcessor, which "External" field holds the Google Cloud identifier for the KRM object. |
|
Optional |
A reference to an externally managed DocumentAIProcessor resource. Should be in the format "projects/{{projectID}}/locations/{{location}}/processors/{{processorID}}". |
|
Optional |
The name of a DocumentAIProcessor resource. |
|
Optional |
The namespace of a DocumentAIProcessor resource. |
|
Optional |
The DocumentAIProcessorVersion name. If not given, the metadata.name will be used. |
Status
Schema
conditions:
- lastTransitionTime: string
message: string
reason: string
status: string
type: string
externalRef: string
observedGeneration: integer
observedState:
create_time: string
document_schema:
description: string
displayName: string
entityTypes:
- baseTypes:
- string
displayName: string
enumValues:
values:
- string
name: string
properties:
- displayName: string
method: string
name: string
occurrenceType: string
valueType: string
metadata:
documentAllowMultipleLabels: boolean
documentSplitter: boolean
prefixedNamingOnProperties: boolean
skipNamingValidation: boolean
gen_ai_model_info:
customGenAiModelInfo:
baseProcessorVersionID: string
customModelType: string
foundationGenAiModelInfo:
finetuningAllowed: boolean
minTrainLabeledDocuments: integer
google_managed: boolean
latest_evaluation:
aggregateMetrics:
f1Score: float
falseNegativesCount: integer
falsePositivesCount: integer
groundTruthDocumentCount: integer
groundTruthOccurrencesCount: integer
precision: float
predictedDocumentCount: integer
predictedOccurrencesCount: integer
recall: float
totalDocumentsCount: integer
truePositivesCount: integer
aggregateMetricsExact:
f1Score: float
falseNegativesCount: integer
falsePositivesCount: integer
groundTruthDocumentCount: integer
groundTruthOccurrencesCount: integer
precision: float
predictedDocumentCount: integer
predictedOccurrencesCount: integer
recall: float
totalDocumentsCount: integer
truePositivesCount: integer
evaluation: string
operation: string
model_type: string
satisfies_pzi: boolean
satisfies_pzs: boolean
state: string
| Fields | |
|---|---|
conditions |
Conditions represent the latest available observations of the object's current state. |
conditions[] |
|
conditions[].lastTransitionTime |
Last time the condition transitioned from one status to another. |
conditions[].message |
Human-readable message indicating details about last transition. |
conditions[].reason |
Unique, one-word, CamelCase reason for the condition's last transition. |
conditions[].status |
Status is the status of the condition. Can be True, False, Unknown. |
conditions[].type |
Type is the type of the condition. |
externalRef |
A unique specifier for the DocumentAI resource in Google Cloud. |
observedGeneration |
ObservedGeneration is the generation of the resource that was most recently observed by the Config Connector controller. If this is equal to metadata.generation, then that means that the current reported status reflects the most recent desired state of the resource. |
observedState |
ObservedState is the state of the resource as most recently observed in Google Cloud. |
observedState.create_time |
The time the processor version was created. |
observedState.document_schema |
The schema of the processor version. Describes the output. |
observedState.document_schema.description |
Description of the schema. |
observedState.document_schema.displayName |
Display name to show to users. |
observedState.document_schema.entityTypes |
Entity types of the schema. |
observedState.document_schema.entityTypes[] |
|
observedState.document_schema.entityTypes[].baseTypes |
The entity type that this type is derived from. For now, one and only one should be set. |
observedState.document_schema.entityTypes[].baseTypes[] |
|
observedState.document_schema.entityTypes[].displayName |
User defined name for the type. |
observedState.document_schema.entityTypes[].enumValues |
If specified, lists all the possible values for this entity. This should not be more than a handful of values. If the number of values is >10 or could change frequently use the `EntityType.value_ontology` field and specify a list of all possible values in a value ontology file. |
observedState.document_schema.entityTypes[].enumValues.values |
The individual values that this enum values type can include. |
observedState.document_schema.entityTypes[].enumValues.values[] |
|
observedState.document_schema.entityTypes[].name |
Name of the type. It must be unique within the schema file and cannot be a "Common Type". The following naming conventions are used:
|
observedState.document_schema.entityTypes[].properties |
Description the nested structure, or composition of an entity. |
observedState.document_schema.entityTypes[].properties[] |
|
observedState.document_schema.entityTypes[].properties[].displayName |
User defined name for the property. |
observedState.document_schema.entityTypes[].properties[].method |
Specifies how the entity's value is obtained. |
observedState.document_schema.entityTypes[].properties[].name |
The name of the property. Follows the same guidelines as the EntityType name. |
observedState.document_schema.entityTypes[].properties[].occurrenceType |
Occurrence type limits the number of instances an entity type appears in the document. |
observedState.document_schema.entityTypes[].properties[].valueType |
A reference to the value type of the property. This type is subject to the same conventions as the `Entity.base_types` field. |
observedState.document_schema.metadata |
Metadata of the schema. |
observedState.document_schema.metadata.documentAllowMultipleLabels |
If true, on a given page, there can be multiple `document` annotations covering it. |
observedState.document_schema.metadata.documentSplitter |
If true, a `document` entity type can be applied to subdocument (splitting). Otherwise, it can only be applied to the entire document (classification). |
observedState.document_schema.metadata.prefixedNamingOnProperties |
If set, all the nested entities must be prefixed with the parents. |
observedState.document_schema.metadata.skipNamingValidation |
If set, skips the naming format validation in the schema, meaning the string values in `DocumentSchema.EntityType.name` and `DocumentSchema.EntityType.Property.name` will not be checked. |
observedState.gen_ai_model_info |
Output only. Information about Generative AI model-based processor versions. |
observedState.gen_ai_model_info.customGenAiModelInfo |
Information for a custom Generative AI model. |
observedState.gen_ai_model_info.customGenAiModelInfo.baseProcessorVersionID |
The base processor version ID for the custom model. |
observedState.gen_ai_model_info.customGenAiModelInfo.customModelType |
The type of custom model created by the user. |
observedState.gen_ai_model_info.foundationGenAiModelInfo |
Information for a pretrained Google-managed foundation model. |
observedState.gen_ai_model_info.foundationGenAiModelInfo.finetuningAllowed |
Whether finetuning is allowed for this base processor version. |
observedState.gen_ai_model_info.foundationGenAiModelInfo.minTrainLabeledDocuments |
The minimum number of labeled documents in the training dataset required for finetuning. |
observedState.google_managed |
Output only. Denotes that this `ProcessorVersion` is managed by Google. |
observedState.latest_evaluation |
The most recently invoked evaluation for the processor version. |
observedState.latest_evaluation.aggregateMetrics |
An aggregate of the statistics for the evaluation with fuzzy matching on. |
observedState.latest_evaluation.aggregateMetrics.f1Score |
The calculated F1 score. |
observedState.latest_evaluation.aggregateMetrics.falseNegativesCount |
The amount of false negatives. |
observedState.latest_evaluation.aggregateMetrics.falsePositivesCount |
The amount of false positives. |
observedState.latest_evaluation.aggregateMetrics.groundTruthDocumentCount |
The amount of documents with a ground truth occurrence. |
observedState.latest_evaluation.aggregateMetrics.groundTruthOccurrencesCount |
The amount of occurrences in ground truth documents. |
observedState.latest_evaluation.aggregateMetrics.precision |
The calculated precision. |
observedState.latest_evaluation.aggregateMetrics.predictedDocumentCount |
The amount of documents with a predicted occurrence. |
observedState.latest_evaluation.aggregateMetrics.predictedOccurrencesCount |
The amount of occurrences in predicted documents. |
observedState.latest_evaluation.aggregateMetrics.recall |
The calculated recall. |
observedState.latest_evaluation.aggregateMetrics.totalDocumentsCount |
The amount of documents that had an occurrence of this label. |
observedState.latest_evaluation.aggregateMetrics.truePositivesCount |
The amount of true positives. |
observedState.latest_evaluation.aggregateMetricsExact |
An aggregate of the statistics for the evaluation with fuzzy matching off. |
observedState.latest_evaluation.aggregateMetricsExact.f1Score |
The calculated f1 score. |
observedState.latest_evaluation.aggregateMetricsExact.falseNegativesCount |
The amount of false negatives. |
observedState.latest_evaluation.aggregateMetricsExact.falsePositivesCount |
The amount of false positives. |
observedState.latest_evaluation.aggregateMetricsExact.groundTruthDocumentCount |
The amount of documents with a ground truth occurrence. |
observedState.latest_evaluation.aggregateMetricsExact.groundTruthOccurrencesCount |
The amount of occurrences in ground truth documents. |
observedState.latest_evaluation.aggregateMetricsExact.precision |
The calculated precision. |
observedState.latest_evaluation.aggregateMetricsExact.predictedDocumentCount |
The amount of documents with a predicted occurrence. |
observedState.latest_evaluation.aggregateMetricsExact.predictedOccurrencesCount |
The amount of occurrences in predicted documents. |
observedState.latest_evaluation.aggregateMetricsExact.recall |
The calculated recall. |
observedState.latest_evaluation.aggregateMetricsExact.totalDocumentsCount |
The amount of documents that had an occurrence of this label. |
observedState.latest_evaluation.aggregateMetricsExact.truePositivesCount |
The amount of true positives. |
observedState.latest_evaluation.evaluation |
The resource name of the evaluation. |
observedState.latest_evaluation.operation |
The resource name of the Long Running Operation for the evaluation. |
observedState.model_type |
Output only. The model type of this processor version. |
observedState.satisfies_pzi |
Output only. Reserved for future use. |
observedState.satisfies_pzs |
Output only. Reserved for future use. |
observedState.state |
Output only. The state of the processor version. |