Grounding with Parallel on Vertex AI is a Separate Offering (as defined in your Google Cloud Agreement) that connects Gemini models to public web data provided by Parallel Web Systems' search API. This service gives Gemini access to live information from billions of web pages to ensure more up-to-date and factual responses.
Use cases
Grounding with Parallel is suitable for use cases such as the following:
- Using web data for information completion or enrichment purposes.
- Multi-hop agents that require deeper web searches for more complex questions.
- Building an API that integrates web search data.
- Employee-facing assistants that allow end users to ask questions and create complex reports using the latest web information.
- Consumer-facing applications, like retail and travel, that allow the user to make informed purchase decisions.
- Automated agents that run autonomously to perform automated tasks, like news analysis and KYC checks.
- Vertical agents (sales agents, coding agents, finance agents) getting the latest up-to-date context from the web.
Example
Who won the 2025 Las Vegas F1 Grand Prix?
| Without Grounding | With Grounding |
|---|---|
| The 2025 Las Vegas Grand Prix has not happened yet. The race is scheduled to take place on the weekend of November 20-22, 2025. Therefore, the winner is currently unknown. | The winner of the 2025 Las Vegas F1 Grand Prix was Max Verstappen of Red Bull Racing. The race took place on November 22, 2025. Sources: domain1.com, domain2.com, ... |
Supported models
This section lists the models that support Grounding with Parallel Web Search
Before you begin
To use Grounding with Parallel Web Search, you must set up your access. You have two options:
Subscribe directly to Grounding with Parallel Web Search on Google Cloud Marketplace (Preview) for a streamlined integration within your existing cloud environment.
Use an existing Parallel API key.
Subscribe on Google Cloud Marketplace (recommended)
Integrating through Google Cloud Marketplace lets you manage your Grounding with Parallel Web Search service directly within Google Cloud. To get started, subscribe to the Grounding with Parallel Web Search service on Google Cloud Marketplace, accept the terms of service, and review the pricing.
Bring your own API key
If you prefer to manage your Parallel billing and API access separately, you can provide your own API key. To use this method, you need to get an API key from the Parallel developer platform. This API key is used directly in your REST API requests to Gemini.
Ground Gemini responses with Parallel
Request grounded responses from Gemini by using the REST API as follows. For best performance, we recommend using default settings for optional parameters unless you strictly require non-default values.
If you subscribed to Grounding with Parallel Web Search on Google Cloud Marketplace (Preview), verify that the billing account used for your subscription is active in your Google Cloud project.
REST
Before using any of the request data, make the following replacements:
- LOCATION: The region to process the request. To use the global endpoint, exclude the location from the endpoint name and configure the location of the resource to `global`.
- PROJECT_ID: Your Google Cloud project ID.
- MODEL_ID: The ID of the model to use.
- TEXT: The text prompt to send to the model.
- API_KEY: Your API key for Parallel Web Search. If you specify an API key and are also subscribed to Grounding with Parallel Web Search on Google Cloud Marketplace (Preview), the API key takes precedence.
- EXCLUDE_DOMAINS: Optional: List of domains to exclude from grounding sources. If specified, sources from these domains are excluded. Acceptable values are domains (www.example.com) or domain extensions starting with a period ( .gov, .edu, .co.uk). You can specify up to 10 domains.
- INCLUDE_DOMAINS: Optional: List of domains to include in grounding sources. If specified, sources from these domains are included. Acceptable values are domains (www.example.com) or domain extensions starting with a period ( .gov, .edu, .co.uk). You can specify up to 10 domains.
- MAX_CHARS_PER_RESULT: Optional: The
maximum number of characters to include in each search result excerpt. If not specified, defaults
to
30000. The allowed range is[1000, 100000]. - MAX_CHARS_TOTAL: Optional: The maximum total
characters from all search result excerpts. If not specified, defaults to
100000. The allowed range is[1000, 1000000]. - MAX_RESULTS: Optional: The maximum number of search
results to use for grounding. If not specified, defaults to
10. The allowed range is[1, 20].
HTTP method and URL:
POST https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:generateContent
Request JSON body:
{
"contents": [{
"role": "user",
"parts": [{
"text": "TEXT"
}]
}],
"tools": [{
"parallelAiSearch": {
"api_key": "API_KEY",
"customConfigs": {
"source_policy": {
"exclude_domains": ["EXCLUDE_DOMAINS"],
"include_domains": ["INCLUDE_DOMAINS"],
},
"excerpts": {
"max_chars_per_result": MAX_CHARS_PER_RESULT,
"max_chars_total": MAX_CHARS_TOTAL
},
"max_results": MAX_RESULTS
}
}
}],
"model": "projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID"
}
To send your request, expand one of these options:
You should receive a JSON response similar to the following.
Quota
The default quota is 200 prompts per minute. To request an increase to your rate limits, provide your use case and requirements to the appropriate contact:
If you subscribed to Grounding with Parallel Web Search on Google Cloud Marketplace (Preview): Contact your Google account team.
If you provide your own API key: Contact support@parallel.ai and your Google account team.
Billing
The use of Grounding with Parallel Web Search incurs the following charges:
Gemini token consumption: You can be billed for:
- Prompt tokens
- Thinking tokens
- Output tokens
For more information, see Pricing.
Gemini's Grounding with your data: For more information, see Pricing.
Pricing for the use of the Parallel Web Search API:
If you subscribed to Grounding with Parallel Web Search on Google Cloud Marketplace (Preview), see the Parallel pricing section on Google Cloud Marketplace for more information.
If you provide your own API key, see the Parallel website for more information.