This tool is used to search and retrieve information from unstructured documents or FAQ content by connecting to a data store populated with your own files.
When you first create the tool, you provide:
- Name: A descriptive name that helps the AI understand the tool's job. Names should start with a verb (for example,
search_internal_docsorquery_product_manuals). - Description: (Optional) An explanation of what the tool does and when the AI should use it. This helps the model decide if this specific data store contains the answer to a user's prompt.
- Mock tool response: An optional configuration used to simulate the tool's output for testing purposes before the data is fully indexed.
- Location: The region where the data store is hosted (for example, global).
Data type: The format of the content you are importing:
- Unstructured data: Best for documents like PDFs, HTML files, or text files.
- FAQ: Best for structured question-and-answer pairs.
Select a folder or a file: The path to your data in Google Cloud Storage (
gs://*). You can choose to import a single File or an entire Folder.Synchronization frequency: How often the tool checks for updates in your source files.
- One time: A single import of the current data.
- Periodic: Automatically updates the data store when source files change (this setting cannot be changed after creation).
FAQ data store
FAQ data stores can hold answers to frequently asked questions (FAQ). When user questions are matched with high confidence to an uploaded question, the agent returns the answer to that question without any modification. You can provide a title and a URL for each question and answer pair that is displayed by the agent.
Data must be uploaded to the data store in CSV format. Each file must have a
header row describing the columns.
For example:
"question","answer","title","url"
"Why is the sky blue?","The sky is blue because of Rayleigh scattering.","Rayleigh scattering","https://en.wikipedia.org/wiki/Rayleigh_scattering"
"What is the meaning of life?","42","",""
The title and url columns are optional and can be omitted:
"answer","question"
"42","What is the meaning of life?"
During the upload process, you can select a folder where each file is treated as
a CSV file regardless of extension.
Limitations:
- An extra space character after
,causes an error. - Blank lines (even at the end of the file) cause an error.
Unstructured data store
Unstructured data stores can contain content in the following formats:
HTMLPDFTXTCSV
It's possible (but uncommon) to import files from another project's Cloud Storage bucket. To do this, you need to grant explicit access to the import process. Follow the instructions in the error message, which will contain the name of the user that needs read access to the bucket in order to perform the import.
Limitations:
- The maximum file size is 2.5MB for text-based formats, 100MB for other formats.
See also Import from Cloud Storage.