Specifies the parsing config for RagFiles.
useAdvancedPdfParsing
(deprecated)boolean
Whether to use advanced PDF parsing.
parserUnion type
parser can be only one of the following:The Advanced Parser to use for RagFiles.
The Layout Parser to use for RagFiles.
The LLM Parser to use for RagFiles.
| JSON representation |
|---|
{ "useAdvancedPdfParsing": boolean, // parser "advancedParser": { object ( |
AdvancedParser
Specifies the advanced parsing for RagFiles.
useAdvancedPdfParsingboolean
Whether to use advanced PDF parsing.
| JSON representation |
|---|
{ "useAdvancedPdfParsing": boolean } |
LayoutParser
Document AI Layout Parser config.
processorNamestring
The full resource name of a Document AI processor or processor version. The processor must have type LAYOUT_PARSER_PROCESSOR. If specified, the additionalConfig.parse_as_scanned_pdf field must be false. Format: * projects/{projectId}/locations/{location}/processors/{processorId} * projects/{projectId}/locations/{location}/processors/{processorId}/processorVersions/{processor_version_id}
maxParsingRequestsPerMininteger
The maximum number of requests the job is allowed to make to the Document AI processor per minute. Consult https://cloud.google.com/document-ai/quotas and the Quota page for your project to set an appropriate value here. If unspecified, a default value of 120 QPM would be used.
globalMaxParsingRequestsPerMininteger
The maximum number of requests the job is allowed to make to the Document AI processor per minute in this project. Consult https://cloud.google.com/document-ai/quotas and the Quota page for your project to set an appropriate value here. If this value is not specified, maxParsingRequestsPerMin will be used by indexing pipeline as the global limit.
| JSON representation |
|---|
{ "processorName": string, "maxParsingRequestsPerMin": integer, "globalMaxParsingRequestsPerMin": integer } |