This tool is used to search and retrieve information from specific websites by connecting to a data store.
When you first create the tool, you provide:
Name: A descriptive name that helps the AI understand the tool's job. Names should start with a verb (for example, search_knowledge_base).
Description: (Optional) An explanation of what the tool does and when the AI should use it.
Location: The region where the data store is hosted (for example, global).
Mock tool response: An optional configuration used to simulate the tool's output for testing purposes before the data is fully indexed.
Sites to include: A list of URLs to index. Use
yoursite.com/*to index an entire site oryoursite.com/page/*to index a specific page.Sites to exclude: Specific URLs or patterns that the tool should ignore.
Limitations:
- You must verify your domain when using website content as a source.
- Files from public URLs must have been crawled by the Google Search indexer, so that they exist in the search index. You can check this with the Google Search Console.
- A maximum of 200,000 pages are indexed. If the data store contains more pages, indexing will fail at that point. Any content already indexed will remain.
See also Create a data store using website content.