A singleton resource under a Processor which configures a collection of documents.
| JSON representation | 
|---|
| { "name": string, "state": enum ( | 
| Fields | |
|---|---|
| name | 
 Dataset resource name. Format:  | 
| state | 
 Required. State of the dataset. Ignored when updating dataset. | 
| satisfiesPzs | 
 Output only. Reserved for future use. | 
| satisfiesPzi | 
 Output only. Reserved for future use. | 
| Union field  
 | |
| gcsManagedConfig | 
 Optional. User-managed Cloud Storage dataset configuration. Use this configuration if the dataset documents are stored under a user-managed Cloud Storage location. | 
| documentWarehouseConfig | 
 Optional. Deprecated. Warehouse-based dataset configuration is not supported. | 
| unmanagedDatasetConfig | 
 Optional. Unmanaged dataset configuration. Use this configuration if the dataset documents are managed by the document service internally (not user-managed). | 
| Union field  
 | |
| spannerIndexingConfig | 
 Optional. A lightweight indexing source with low latency and high reliability, but lacking advanced features like CMEK and content-based search. | 
GCSManagedConfig
Configuration specific to the Cloud Storage-based implementation.
| JSON representation | 
|---|
| {
  "gcsPrefix": {
    object ( | 
| Fields | |
|---|---|
| gcsPrefix | 
 Required. The Cloud Storage URI (a directory) where the documents belonging to the dataset must be stored. | 
GcsPrefix
Specifies all documents on Cloud Storage with a common prefix.
| JSON representation | 
|---|
| { "gcsUriPrefix": string } | 
| Fields | |
|---|---|
| gcsUriPrefix | 
 The URI prefix. | 
DocumentWarehouseConfig
Configuration specific to the Document AI Warehouse-based implementation.
| JSON representation | 
|---|
| { "collection": string, "schema": string } | 
| Fields | |
|---|---|
| collection | 
 Output only. The collection in Document AI Warehouse associated with the dataset. | 
| schema | 
 Output only. The schema in Document AI Warehouse associated with the dataset. | 
UnmanagedDatasetConfig
This type has no fields.
Configuration specific to an unmanaged dataset.
SpannerIndexingConfig
This type has no fields.
Configuration specific to spanner-based indexing.