Tool: get_dataset_info
Get metadata information about a BigQuery dataset.
The following sample demonstrate how to use curl to invoke the get_dataset_info MCP tool.
| Curl Request |
|---|
curl --location 'https://bigquery.googleapis.com/mcp' \ --header 'content-type: application/json' \ --header 'accept: application/json, text/event-stream' \ --data '{ "method": "tools/call", "params": { "name": "get_dataset_info", "arguments": { // provide these details according to the tool's MCP specification } }, "jsonrpc": "2.0", "id": 1 }' |
Input Schema
Request for a dataset.
| JSON representation |
|---|
{ "projectId": string, "datasetId": string } |
| Fields | |
|---|---|
projectId |
Required. Project ID of the dataset request. |
datasetId |
Required. Dataset ID of the dataset request. |
Output Schema
Represents a BigQuery dataset.
| JSON representation |
|---|
{ "kind": string, "etag": string, "id": string, "selfLink": string, "datasetReference": { object ( |
| Fields | |
|---|---|
kind |
Output only. The resource type. |
etag |
Output only. A hash of the resource. |
id |
Output only. The fully-qualified unique name of the dataset in the format projectId:datasetId. The dataset name without the project name is given in the datasetId field. When creating a new dataset, leave this field blank, and instead specify the datasetId field. |
selfLink |
Output only. A URL that can be used to access the resource again. You can use this URL in Get or Update requests to the resource. |
datasetReference |
Required. A reference that identifies the dataset. |
friendlyName |
Optional. A descriptive name for the dataset. |
description |
Optional. A user-friendly description of the dataset. |
defaultTableExpirationMs |
Optional. The default lifetime of all tables in the dataset, in milliseconds. The minimum lifetime value is 3600000 milliseconds (one hour). To clear an existing default expiration with a PATCH request, set to 0. Once this property is set, all newly-created tables in the dataset will have an expirationTime property set to the creation time plus the value in this property, and changing the value will only affect new tables, not existing ones. When the expirationTime for a given table is reached, that table will be deleted automatically. If a table's expirationTime is modified or removed before the table expires, or if you provide an explicit expirationTime when creating a table, that value takes precedence over the default expiration time indicated by this property. |
defaultPartitionExpirationMs |
This default partition expiration, expressed in milliseconds. When new time-partitioned tables are created in a dataset where this property is set, the table will inherit this value, propagated as the When creating a partitioned table, if |
labels |
The labels associated with this dataset. You can use these to organize and group your datasets. You can set this property when inserting or updating a dataset. See Creating and Updating Dataset Labels for more information. An object containing a list of |
access[] |
Optional. An array of objects that define dataset access for one or more entities. You can set this property when inserting or updating a dataset in order to control who is allowed to access the data. If unspecified at dataset creation time, BigQuery adds default dataset access for the following entities: access.specialGroup: projectReaders; access.role: READER; access.specialGroup: projectWriters; access.role: WRITER; access.specialGroup: projectOwners; access.role: OWNER; access.userByEmail: [dataset creator email]; access.role: OWNER; If you patch a dataset, then this field is overwritten by the patched dataset's access field. To add entities, you must supply the entire existing access array in addition to any new entities that you want to add. |
creationTime |
Output only. The time when this dataset was created, in milliseconds since the epoch. |
lastModifiedTime |
Output only. The date when this dataset was last modified, in milliseconds since the epoch. |
location |
The geographic location where the dataset should reside. See https://cloud.google.com/bigquery/docs/locations for supported locations. |
defaultEncryptionConfiguration |
The default encryption key for all tables in the dataset. After this property is set, the encryption key of all newly-created tables in the dataset is set to this value unless the table creation request or query explicitly overrides the key. |
satisfiesPzs |
Output only. Reserved for future use. |
satisfiesPzi |
Output only. Reserved for future use. |
type |
Output only. Same as
|
linkedDatasetSource |
Optional. The source dataset reference when the dataset is of type LINKED. For all other dataset types it is not set. This field cannot be updated once it is set. Any attempt to update this field using Update and Patch API Operations will be ignored. |
linkedDatasetMetadata |
Output only. Metadata about the LinkedDataset. Filled out when the dataset type is LINKED. |
externalDatasetReference |
Optional. Reference to a read-only external dataset defined in data catalogs outside of BigQuery. Filled out when the dataset type is EXTERNAL. |
externalCatalogDatasetOptions |
Optional. Options defining open source compatible datasets living in the BigQuery catalog. Contains metadata of open source database, schema or namespace represented by the current dataset. |
isCaseInsensitive |
Optional. TRUE if the dataset and its table names are case-insensitive, otherwise FALSE. By default, this is FALSE, which means the dataset and its table names are case-sensitive. This field does not affect routine references. |
defaultCollation |
Optional. Defines the default collation specification of future tables created in the dataset. If a table is created in this dataset without table-level default collation, then the table inherits the dataset default collation, which is applied to the string fields that do not have explicit collation specified. A change to this field affects only tables created afterwards, and does not alter the existing tables. The following values are supported:
|
defaultRoundingMode |
Optional. Defines the default rounding mode specification of new tables created within this dataset. During table creation, if this field is specified, the table within this dataset will inherit the default rounding mode of the dataset. Setting the default rounding mode on a table overrides this option. Existing tables in the dataset are unaffected. If columns are defined during that table creation, they will immediately inherit the table's default rounding mode, unless otherwise specified. |
maxTimeTravelHours |
Optional. Defines the time travel window in hours. The value can be from 48 to 168 hours (2 to 7 days). The default value is 168 hours if this is not set. |
tags[] |
Output only. Tags for the dataset. To provide tags as inputs, use the |
storageBillingModel |
Optional. Updates storage_billing_model for the dataset. |
restrictions |
Optional. Output only. Restriction config for all tables and dataset. If set, restrict certain accesses on the dataset and all its tables based on the config. See Data egress for more details. |
resourceTags |
Optional. The tags attached to this dataset. Tag keys are globally unique. Tag key is expected to be in the namespaced format, for example "123456789012/environment" where 123456789012 is the ID of the parent organization or project resource for this tag key. Tag value is expected to be the short name, for example "Production". See Tag definitions for more details. An object containing a list of |
| JSON representation |
|---|
{ "datasetId": string, "projectId": string, "datasetIdAlternative": [ string ], "projectIdAlternative": [ string ] } |
| Fields | |
|---|---|
datasetId |
Required. A unique ID for this dataset, without the project name. The ID must contain only letters (a-z, A-Z), numbers (0-9), or underscores (_). The maximum length is 1,024 characters. |
projectId |
Optional. The ID of the project containing this dataset. |
datasetIdAlternative[] |
This field should not be used. |
projectIdAlternative[] |
This field should not be used. |
| JSON representation |
|---|
{ "value": string } |
| Fields | |
|---|---|
value |
The string value. |
| JSON representation |
|---|
{ "value": string } |
| Fields | |
|---|---|
value |
The int64 value. |
| JSON representation |
|---|
{ "key": string, "value": string } |
| Fields | |
|---|---|
key |
|
value |
|
| JSON representation |
|---|
{ "role": string, "userByEmail": string, "groupByEmail": string, "domain": string, "specialGroup": string, "iamMember": string, "view": { object ( |
| Fields | |
|---|---|
role |
An IAM role ID that should be granted to the user, group, or domain specified in this access entry. The following legacy mappings will be applied:
This field will accept any of the above formats, but will return only the legacy format. For example, if you set this field to "roles/bigquery.dataOwner", it will be returned back as "OWNER". |
userByEmail |
[Pick one] An email address of a user to grant access to. For example: fred@example.com. Maps to IAM policy member "user:EMAIL" or "serviceAccount:EMAIL". |
groupByEmail |
[Pick one] An email address of a Google Group to grant access to. Maps to IAM policy member "group:GROUP". |
domain |
[Pick one] A domain to grant access to. Any users signed in with the domain specified will be granted the specified access. Example: "example.com". Maps to IAM policy member "domain:DOMAIN". |
specialGroup |
[Pick one] A special group to grant access to. Possible values include:
Maps to similarly-named IAM members. |
iamMember |
[Pick one] Some other type of member that appears in the IAM Policy but isn't a user, group, domain, or special group. |
view |
[Pick one] A view from a different dataset to grant access to. Queries executed against that view will have read access to views/tables/routines in this dataset. The role field is not required when this field is set. If that view is updated by any user, access to the view needs to be granted again via an update operation. |
routine |
[Pick one] A routine from a different dataset to grant access to. Queries executed against that routine will have read access to views/tables/routines in this dataset. Only UDF is supported for now. The role field is not required when this field is set. If that routine is updated by any user, access to the routine needs to be granted again via an update operation. |
dataset |
[Pick one] A grant authorizing all resources of a particular type in a particular dataset access to this dataset. Only views are supported for now. The role field is not required when this field is set. If that dataset is deleted and re-created, its access needs to be granted again via an update operation. |
condition |
Optional. condition for the binding. If CEL expression in this field is true, this access binding will be considered |
| JSON representation |
|---|
{ "projectId": string, "datasetId": string, "tableId": string, "projectIdAlternative": [ string ], "datasetIdAlternative": [ string ], "tableIdAlternative": [ string ] } |
| Fields | |
|---|---|
projectId |
Required. The ID of the project containing this table. |
datasetId |
Required. The ID of the dataset containing this table. |
tableId |
Required. The ID of the table. The ID can contain Unicode characters in category L (letter), M (mark), N (number), Pc (connector, including underscore), Pd (dash), and Zs (space). For more information, see General Category. The maximum length is 1,024 characters. Certain operations allow suffixing of the table ID with a partition decorator, such as |
projectIdAlternative[] |
This field should not be used. |
datasetIdAlternative[] |
This field should not be used. |
tableIdAlternative[] |
This field should not be used. |
| JSON representation |
|---|
{ "projectId": string, "datasetId": string, "routineId": string } |
| Fields | |
|---|---|
projectId |
Required. The ID of the project containing this routine. |
datasetId |
Required. The ID of the dataset containing this routine. |
routineId |
Required. The ID of the routine. The ID must contain only letters (a-z, A-Z), numbers (0-9), or underscores (_). The maximum length is 256 characters. |
| JSON representation |
|---|
{ "dataset": { object ( |
| Fields | |
|---|---|
dataset |
The dataset this entry applies to |
targetTypes[] |
Which resources in the dataset this entry applies to. Currently, only views are supported, but additional target types may be added in the future. |
| JSON representation |
|---|
{ "expression": string, "title": string, "description": string, "location": string } |
| Fields | |
|---|---|
expression |
Textual representation of an expression in Common Expression Language syntax. |
title |
Optional. Title for the expression, i.e. a short string describing its purpose. This can be used e.g. in UIs which allow to enter the expression. |
description |
Optional. Description of the expression. This is a longer text which describes the expression, e.g. when hovered over it in a UI. |
location |
Optional. String indicating the location of the expression for error reporting, e.g. a file name and a position in the file. |
| JSON representation |
|---|
{ "kmsKeyName": string } |
| Fields | |
|---|---|
kmsKeyName |
Optional. Describes the Cloud KMS encryption key that will be used to protect destination BigQuery table. The BigQuery Service Account associated with your project requires access to this encryption key. |
| JSON representation |
|---|
{ "value": boolean } |
| Fields | |
|---|---|
value |
The bool value. |
| JSON representation |
|---|
{
"sourceDataset": {
object ( |
| Fields | |
|---|---|
sourceDataset |
The source dataset reference contains project numbers and not project ids. |
| JSON representation |
|---|
{
"linkState": enum ( |
| Fields | |
|---|---|
linkState |
Output only. Specifies whether Linked Dataset is currently in a linked state or not. |
| JSON representation |
|---|
{ "externalSource": string, "connection": string } |
| Fields | |
|---|---|
externalSource |
Required. External source that backs this dataset. |
connection |
Required. The connection id that is used to access the external_source. Format: projects/{project_id}/locations/{location_id}/connections/{connection_id} |
| JSON representation |
|---|
{ "parameters": { string: string, ... }, "defaultStorageLocationUri": string } |
| Fields | |
|---|---|
parameters |
Optional. A map of key value pairs defining the parameters and properties of the open source schema. Maximum size of 2MiB. An object containing a list of |
defaultStorageLocationUri |
Optional. The storage location URI for all tables in the dataset. Equivalent to hive metastore's database locationUri. Maximum length of 1024 characters. |
| JSON representation |
|---|
{ "key": string, "value": string } |
| Fields | |
|---|---|
key |
|
value |
|
| JSON representation |
|---|
{ "tagKey": string, "tagValue": string } |
| Fields | |
|---|---|
tagKey |
Required. The namespaced friendly name of the tag key, e.g. "12345/environment" where 12345 is org id. |
tagValue |
Required. The friendly short name of the tag value, e.g. "production". |
| JSON representation |
|---|
{
"type": enum ( |
| Fields | |
|---|---|
type |
Output only. Specifies the type of dataset/table restriction. |
| JSON representation |
|---|
{ "key": string, "value": string } |
| Fields | |
|---|---|
key |
|
value |
|
Tool Annotations
Destructive Hint: ❌ | Idempotent Hint: ✅ | Read Only Hint: ✅ | Open World Hint: ❌