Full name: projects.locations.processors.dataset.importDocuments
Import documents into a dataset.
HTTP request
POST https://{endpoint}/v1beta3/{dataset}:importDocuments
Where {endpoint} is one of the supported service endpoints.
Path parameters
| Parameters | |
|---|---|
dataset |
Required. The dataset resource name. Format: projects/{project}/locations/{location}/processors/{processor}/dataset It takes the form |
Request body
The request body contains data with the following structure:
| JSON representation |
|---|
{
"batchDocumentsImportConfigs": [
{
object ( |
| Fields | |
|---|---|
batchDocumentsImportConfigs[] |
Required. The Cloud Storage uri containing raw documents that must be imported. |
Response body
If successful, the response body contains an instance of Operation.
Authorization scopes
Requires the following OAuth scope:
https://www.googleapis.com/auth/cloud-platform
For more information, see the Authentication Overview.
IAM Permissions
Requires the following IAM permission on the dataset resource:
documentai.datasets.createDocuments
For more information, see the IAM documentation.
BatchDocumentsImportConfig
Config for importing documents. Each batch can have its own dataset split type.
| JSON representation |
|---|
{ "batchInputConfig": { object ( |
| Fields | |
|---|---|
batchInputConfig |
The common config to specify a set of documents used as input. |
Union field
|
|
datasetSplit |
Target dataset split where the documents must be stored. |
autoSplitConfig |
If set, documents will be automatically split into training and test split category with the specified ratio. |
AutoSplitConfig
The config for auto-split.
| JSON representation |
|---|
{ "trainingSplitRatio": number } |
| Fields | |
|---|---|
trainingSplitRatio |
Ratio of training dataset split. |