This document describes how to use the Compute Engine remote Model Context Protocol (MCP) server to connect to Compute Engine from AI applications such as Gemini CLI, agent mode in Gemini Code Assist, Claude Code, or in AI applications that you're developing.
Model Context Protocol (MCP) standardizes the way large language models (LLMs) and AI applications or agents connect to outside data sources. MCP servers let you use their tools, resources, and prompts to take actions and get updated data from their backend service.
Local MCP servers typically run on your local machine and use the standard input and output streams (stdio) for communication between services on the same device. Remote MCP servers run on the service's infrastructure and offer an HTTP endpoint to AI applications for communication between the AI MCP client and the MCP server. For more information on MCP architecture, see MCP architecture.
Google and Google Cloud remote MCP servers have the following features and benefits:
- Simplified, centralized discovery.
- Managed global or regional HTTP endpoints.
- Fine-grained authorization.
- Optional prompt and response security with Model Armor protection.
- Centralized audit logging.
For information about other MCP servers and the security and governance controls that are available for Google Cloud MCP servers, see Google Cloud MCP servers overview.
Before you begin
- Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.
-
In the Google Cloud console, on the project selector page, select or create a Google Cloud project.
Roles required to select or create a project
- Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
-
Create a project: To create a project, you need the Project Creator role
(
roles/resourcemanager.projectCreator), which contains theresourcemanager.projects.createpermission. Learn how to grant roles.
-
Verify that billing is enabled for your Google Cloud project.
-
Make sure that you have the following role or roles on the project: Compute Instance Admin (v1), Compute Security Admin, Service Account User, Service Usage Admin
Check for the roles
-
In the Google Cloud console, go to the IAM page.
Go to IAM - Select the project.
-
In the Principal column, find all rows that identify you or a group that you're included in. To learn which groups you're included in, contact your administrator.
- For all rows that specify or include you, check the Role column to see whether the list of roles includes the required roles.
Grant the roles
-
In the Google Cloud console, go to the IAM page.
Go to IAM - Select the project.
- Click Grant access.
-
In the New principals field, enter your user identifier. This is typically the email address for a Google Account.
- Click Select a role, then search for the role.
- To grant additional roles, click Add another role and add each additional role.
- Click Save.
-
-
In the Google Cloud console, on the project selector page, select or create a Google Cloud project.
Roles required to select or create a project
- Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
-
Create a project: To create a project, you need the Project Creator role
(
roles/resourcemanager.projectCreator), which contains theresourcemanager.projects.createpermission. Learn how to grant roles.
-
Verify that billing is enabled for your Google Cloud project.
-
Make sure that you have the following role or roles on the project: Compute Instance Admin (v1), Compute Security Admin, Service Account User, Service Usage Admin
Check for the roles
-
In the Google Cloud console, go to the IAM page.
Go to IAM - Select the project.
-
In the Principal column, find all rows that identify you or a group that you're included in. To learn which groups you're included in, contact your administrator.
- For all rows that specify or include you, check the Role column to see whether the list of roles includes the required roles.
Grant the roles
-
In the Google Cloud console, go to the IAM page.
Go to IAM - Select the project.
- Click Grant access.
-
In the New principals field, enter your user identifier. This is typically the email address for a Google Account.
- Click Select a role, then search for the role.
- To grant additional roles, click Add another role and add each additional role.
- Click Save.
-
- Enable the Compute Engine API.
Required roles
To get the permissions that
you need to to enable the Compute Engine remote MCP server,
ask your administrator to grant you the
Service Usage Admin (roles/serviceusage.serviceUsageAdmin)
IAM role on your Google Cloud project.
For more information about granting roles, see Manage access to projects, folders, and organizations.
You might also be able to get the required permissions through custom roles or other predefined roles.
Roles for using the service
To get the permission that
you need to to make calls to the remote MCP server tools,
ask your administrator to grant you the
MCP Tool User (roles/mcp.toolUser)
IAM role on your Google Cloud project.
For more information about granting roles, see Manage access to projects, folders, and organizations.
This predefined role contains the
mcp.tools.call
permission,
which is required to
to make calls to the remote MCP server tools.
You might also be able to get this permission with custom roles or other predefined roles.
You also need the roles and permissions required to perform the Compute Engine operations. For more information, see Compute Engine roles and permissions.
Enable or disable the Compute Engine MCP server
You can enable or disable the Compute Engine MCP server
in a project with the gcloud beta services mcp enable command. For more
information, see the following sections.
Enable the Compute Engine MCP server in a project
If you are using different projects for your client credentials, such as service account keys, OAuth client ID or API keys, and for hosting your resources, then you must enable the Compute Engine service and the Compute Engine remote MCP server on both projects.
To enable the Compute Engine MCP server in your Google Cloud project, run the following command:
gcloud beta services mcp enable SERVICE \
--project=PROJECT_ID
Replace the following:
PROJECT_ID: the Google Cloud project ID.SERVICE: the global or regional service name for Compute Engine. For example,compute.googleapis.com. For available regions, see the Compute Engine MCP reference.
The Compute Engine remote MCP server is enabled for use in your Google Cloud project. If the Compute Engine service isn't enabled for your Google Cloud project, you are prompted to enable the service before enabling the Compute Engine remote MCP server.
As a security best practice, we recommend that you enable MCP servers only for the services required for your AI application to function.
Disable the Compute Engine MCP server in a project
To disable the Compute Engine MCP server in your Google Cloud project, run the following command:
gcloud beta services mcp disable SERVICE \
--project=PROJECT_ID
The Compute Engine MCP server is disabled for use in your Google Cloud Project.
Authentication and authorization
Compute Engine MCP servers use the OAuth 2.0 protocol with Identity and Access Management (IAM) for authentication and authorization. All Google Cloud identities are supported for authentication to MCP servers.
We recommend that you create a separate identity for agents that are using MCP tools so that access to resources can be controlled and monitored. For more information on authentication, see Authenticate to MCP servers.
Compute Engine MCP OAuth scopes
OAuth 2.0 uses scopes and credentials to determine if an authenticated principal is authorized to take a specific action on a resource. For more information about OAuth 2.0 scopes at Google, read Using OAuth 2.0 to access Google APIs.
Compute Engine has the following MCP tool OAuth scopes:
| Scope URI for gcloud CLI | Description |
|---|---|
https://www.googleapis.com/auth/compute.read-only |
Only allows access to read data. |
https://www.googleapis.com/auth/compute.read-write |
Allows access to read and modify data. |
Additional scopes might be required on the resources accessed during a tool call. To view a list of scopes required for Compute Engine, see Compute Engine API.
Configure an MCP client to use the Compute Engine MCP server
Host programs, such as Claude or Gemini CLI, can instantiate MCP clients that connect to a single MCP server. A host program can have multiple clients that connect to different MCP servers. To connect to a remote MCP server, the MCP client must know the remote MCP server's URL.
The method that you use to connect to a remote MCP server varies depending on
the host program that you use. For example, you can use the
gemini mcp remote add command to add a remote MCP server in Gemini
CLI. You are prompted to enter details about the server, such as its name and
URL.
For the Compute Engine MCP server, enter the following as required:
- Server name: Compute Engine MCP server
- Server URL or Endpoint: compute.googleapis.com/mcp
- Transport: HTTP
- Authentication details: Depending on how you want to authenticate, you can enter your Google Cloud credentials, your OAuth Client ID and secret, or an agent identity and credentials. For more information on authentication, see Authenticate to MCP servers.
For host-specific guidance about setting up and connecting to MCP server, see the following:
For more general guidance, see Connect to remote MCP servers.
Available tools
MCP Tools that are read-only have
the MCP attribute mcp.tool.isReadOnly set to true. You might want to only
allow read-only tools in certain environments through your
organization policy.
To view details of available MCP tools and their descriptions for the Compute Engine MCP server, see the Compute Engine MCP reference.
List tools
Use the MCP inspector to list tools, or send a
tools/list HTTP request directly to the Compute Engine
remote MCP server. The tools/list method doesn't require authentication.
POST /mcp HTTP/1.1
Host: compute.googleapis.com
Content-Type: application/json
{
"jsonrpc": "2.0",
"method": "tools/list",
}
Example use cases
The following sample use cases describe how you can use the Compute Engine MCP server to manage Compute Engine resources:
- Inspect and manage resources. For example, to understand resource allocation and configuration in your project, you can list all compute instances. You can also find all running compute instances in a zone that have a specific accelerator attached, and show their location and name for resource management.
- Clean up unused resources to reduce operational costs. For example, identify and clean up disk snapshots in a zone that are no longer associated with a source disk, or identify and delete stopped VM instances that have costly GPU resources attached.
- Optimize instance performance. For example, resize an under-provisioned VM instance to a larger machine type in the same family, and confirm the successful update.
- Provision specialized VMs for AI workloads with zone flexibility. For example, create a VM instance with a specific GPU accelerator attached, in any zone in a specified region where it is available.
- Troubleshoot and validate instance configurations. For example, retrieve configuration details for a specific VM instance where the job is frozen, reboot it, and confirm the underlying accelerator and disk are attached.
Sample prompts
The following are sample prompts that you can use to perform tasks by using the Compute Engine MCP server:
- List all VMs in
PROJECT_ID, including the VM name and zone. - Show the instance details for
VM_NAME. - In
REGION, find all disk snapshots for which the source disk no longer exists. - Change the machine type of
VM_NAMEto the next largest machine type in the same machine family, send notification when it's back online, and confirm the new machine type. - Find all running VMs in
REGIONwith NVIDIA accelerators, and show the zone and name for these VMs. - Create a VM in
ZONEwith an NVIDIA T4 accelerator attached. Name the VMmy-nvidiat4-vm. - Find all stopped VMs in
REGIONwith NVIDIA Tesla T4 accelerators, and delete them.
Replace the following:
PROJECT_ID: the Google Cloud project ID.REGION: the name of the region where your resources exist.ZONE: the name of the zone where your VMs exist.VM_NAME: the name of your VM instance.
Optional security and safety configurations
MCP introduces new security risks and considerations due to the wide variety of actions that you can do with the MCP tools. To minimize and manage these risks, Google Cloud offers default settings and customizable policies to control the use of MCP tools in your Google Cloud organization or project.
For more information about MCP security and governance, see AI security and safety.
Use Model Armor
Model Armor is a Google Cloud service that's designed to enhance the security and safety of your AI applications. It works by proactively screening LLM prompts and responses, protecting against various risks and supporting responsible AI practices. Whether you are deploying AI in your cloud environment, or on external cloud providers, Model Armor can help you prevent malicious input, verify content safety, protect sensitive data, maintain compliance, and enforce your AI safety and security policies consistently across your diverse AI landscape.
Model Armor is only available in specific regional locations. If Model Armor is enabled for a project, and a call to that project comes from an unsupported region, Model Armor makes a cross-regional call. For more information, see Model Armor locations.
Enable Model Armor
To enable Model Armor, complete the following steps:
To enable Model Armor on your Google Cloud project, run the following gcloud CLI command:
gcloud services enable modelarmor.googleapis.com \ --project=PROJECT_IDReplace
PROJECT_IDwith your Google Cloud project ID.To configure the recommended floor settings for Model Armor, run the following gcloud CLI command:
gcloud model-armor floorsettings update \ --full-uri='projects/PROJECT_ID/locations/global/floorSetting' \ --mcp-sanitization=ENABLED \ --malicious-uri-filter-settings-enforcement=ENABLED \ --pi-and-jailbreak-filter-settings-enforcement=ENABLED \ --pi-and-jailbreak-filter-settings-confidence-level=MEDIUM_AND_ABOVEReplace
PROJECT_IDwith your Google Cloud project ID.Model Armor is configured to scan for malicious URLs and prompt injection and jailbreak attempts.
For more information about configurable Model Armor filters, see Model Armor filters.
To add Model Armor as a content security provider for MCP services, run the following gcloud CLI command:
gcloud beta services mcp content-security add modelarmor.googleapis.com \ --project=PROJECT_IDReplace
PROJECT_IDwith the Google Cloud project ID.To confirm that MCP traffic is sent to Model Armor, run the following command:
gcloud beta services mcp content-security get \ --project=PROJECT_IDReplace
PROJECT_IDwith the Google Cloud project ID.
Model Armor logging
For information about Model Armor audit and platform logs, see Model Armor audit logging.
Disable Model Armor in a project
To disable Model Armor on a Google Cloud project, run the following command:
gcloud beta services mcp content-security remove modelarmor.googleapis.com \
--project=PROJECT_ID
Replace PROJECT_ID with the
Google Cloud project ID.
MCP traffic on Google Cloud won't be scanned by Model Armor for the specified project.
Disable scanning MCP traffic with Model Armor
If you want to keep using Model Armor in a project, but you want to stop scanning MCP traffic with Model Armor, then run the following command:
gcloud model-armor floorsettings update \
--full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
--mcp-sanitization=DISABLED
Replace PROJECT_ID with the
Google Cloud project ID.
Model Armor won't scan MCP traffic on Google Cloud.
Organization level MCP control
You can create custom organization policies to control the use of MCP servers
in your Google Cloud organization by using the
gcp.managed.allowedMCPService constraint. For more information and
example use cases, see
Google Cloud MCP servers Access control with IAM.
What's next
- Read the Compute Engine MCP reference documentation.
- Learn more about Google Cloud MCP servers.