Class BatchDocumentsImportConfig (3.14.0)

BatchDocumentsImportConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Config for importing documents. Each batch can have its own dataset split type.

This message has oneof_ fields (mutually exclusive fields). For each oneof, at most one member field can be set at the same time. Setting any member of the oneof automatically clears all other members.

.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields

Attributes

Name Description
dataset_split google.cloud.documentai_v1beta3.types.DatasetSplitType
Target dataset split where the documents must be stored. This field is a member of oneof_ split_type_config.
auto_split_config google.cloud.documentai_v1beta3.types.ImportDocumentsRequest.BatchDocumentsImportConfig.AutoSplitConfig
If set, documents will be automatically split into training and test split category with the specified ratio. This field is a member of oneof_ split_type_config.
batch_input_config google.cloud.documentai_v1beta3.types.BatchDocumentsInputConfig
The common config to specify a set of documents used as input.
document_type str
Optional. If set, determines the type of the documents to be imported in this batch. It can be used to auto-label the documents with a single entity of the provided type. This field can only be used with a classifier or splitter processor. Providing this field is mutually exclusive with entities and auto_labeling_config.

Classes

AutoSplitConfig

AutoSplitConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The config for auto-split.