Use the Compute Engine remote MCP server

This document describes how to use the Compute Engine remote Model Context Protocol (MCP) server to connect to Compute Engine from AI applications such as Gemini CLI, agent mode in Gemini Code Assist, Claude Code, or in AI applications that you're developing.

Model Context Protocol (MCP) standardizes the way large language models (LLMs) and AI applications or agents connect to outside data sources. MCP servers let you use their tools, resources, and prompts to take actions and get updated data from their backend service.

Local MCP servers typically run on your local machine and use the standard input and output streams (stdio) for communication between services on the same device. Remote MCP servers run on the service's infrastructure and offer an HTTP endpoint to AI applications for communication between the AI MCP client and the MCP server. For more information on MCP architecture, see MCP architecture.

Google and Google Cloud remote MCP servers have the following features and benefits:

Simplified, centralized discovery.
Managed global or regional HTTP endpoints.
Fine-grained authorization.
Optional prompt and response security with Model Armor protection.
Centralized audit logging.

For information about other MCP servers and the security and governance controls that are available for Google Cloud MCP servers, see Google Cloud MCP servers overview.

Before you begin

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Make sure that you have the following role or roles on the project: Compute Instance Admin (v1), Compute Security Admin, Service Account User, Service Usage Admin

Check for the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
In the Principal column, find all rows that identify you or a group that you're included in. To learn which groups you're included in, contact your administrator.
For all rows that specify or include you, check the Role column to see whether the list of roles includes the required roles.

Grant the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
Click Grant access.
In the New principals field, enter your user identifier. This is typically the email address for a Google Account.
Click Select a role, then search for the role.
To grant additional roles, click Add another role and add each additional role.
Click Save.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Make sure that you have the following role or roles on the project: Compute Instance Admin (v1), Compute Security Admin, Service Account User, Service Usage Admin

Check for the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
In the Principal column, find all rows that identify you or a group that you're included in. To learn which groups you're included in, contact your administrator.
For all rows that specify or include you, check the Role column to see whether the list of roles includes the required roles.

Grant the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
Click Grant access.
In the New principals field, enter your user identifier. This is typically the email address for a Google Account.
Click Select a role, then search for the role.
To grant additional roles, click Add another role and add each additional role.
Click Save.

Enable the Compute Engine API.
Enable the Compute Engine API

Required roles

To get the permissions that you need to to enable the Compute Engine remote MCP server, ask your administrator to grant you the Service Usage Admin (roles/serviceusage.serviceUsageAdmin) IAM role on your Google Cloud project. For more information about granting roles, see Manage access to projects, folders, and organizations.

You might also be able to get the required permissions through custom roles or other predefined roles.

Roles for using the service

To get the permission that you need to to make calls to the remote MCP server tools, ask your administrator to grant you the MCP Tool User (roles/mcp.toolUser) IAM role on your Google Cloud project. For more information about granting roles, see Manage access to projects, folders, and organizations.

This predefined role contains the mcp.tools.call permission, which is required to to make calls to the remote MCP server tools.

You might also be able to get this permission with custom roles or other predefined roles.

You also need the roles and permissions required to perform the Compute Engine operations. For more information, see Compute Engine roles and permissions.

Enable or disable the Compute Engine MCP server

You can enable or disable the Compute Engine MCP server in a project with the gcloud beta services mcp enable command. For more information, see the following sections.

Enable the Compute Engine MCP server in a project

If you are using different projects for your client credentials, such as service account keys, OAuth client ID or API keys, and for hosting your resources, then you must enable the Compute Engine service and the Compute Engine remote MCP server on both projects.

To enable the Compute Engine MCP server in your Google Cloud project, run the following command:

gcloud beta services mcp enable SERVICE \
    --project=PROJECT_ID

Replace the following:

PROJECT_ID: the Google Cloud project ID.
SERVICE: the global or regional service name for Compute Engine. For example, compute.googleapis.com. For available regions, see the Compute Engine MCP reference.

The Compute Engine remote MCP server is enabled for use in your Google Cloud project. If the Compute Engine service isn't enabled for your Google Cloud project, you are prompted to enable the service before enabling the Compute Engine remote MCP server.

As a security best practice, we recommend that you enable MCP servers only for the services required for your AI application to function.

Disable the Compute Engine MCP server in a project

To disable the Compute Engine MCP server in your Google Cloud project, run the following command:

gcloud beta services mcp disable SERVICE \
    --project=PROJECT_ID

The Compute Engine MCP server is disabled for use in your Google Cloud Project.

Authentication and authorization

Compute Engine MCP servers use the OAuth 2.0 protocol with Identity and Access Management (IAM) for authentication and authorization. All Google Cloud identities are supported for authentication to MCP servers.

We recommend that you create a separate identity for agents that are using MCP tools so that access to resources can be controlled and monitored. For more information on authentication, see Authenticate to MCP servers.

Compute Engine MCP OAuth scopes

OAuth 2.0 uses scopes and credentials to determine if an authenticated principal is authorized to take a specific action on a resource. For more information about OAuth 2.0 scopes at Google, read Using OAuth 2.0 to access Google APIs.

Compute Engine has the following MCP tool OAuth scopes:

Scope URI for gcloud CLI	Description
`https://www.googleapis.com/auth/compute.read-only`	Only allows access to read data.
`https://www.googleapis.com/auth/compute.read-write`	Allows access to read and modify data.

Additional scopes might be required on the resources accessed during a tool call. To view a list of scopes required for Compute Engine, see Compute Engine API.

Configure an MCP client to use the Compute Engine MCP server

Host programs, such as Claude or Gemini CLI, can instantiate MCP clients that connect to a single MCP server. A host program can have multiple clients that connect to different MCP servers. To connect to a remote MCP server, the MCP client must know the remote MCP server's URL.

The method that you use to connect to a remote MCP server varies depending on the host program that you use. For example, you can use the gemini mcp remote add command to add a remote MCP server in Gemini CLI. You are prompted to enter details about the server, such as its name and URL.

For the Compute Engine MCP server, enter the following as required:

Server name: Compute Engine MCP server
Server URL or Endpoint: compute.googleapis.com/mcp
Transport: HTTP
Authentication details: Depending on how you want to authenticate, you can enter your Google Cloud credentials, your OAuth Client ID and secret, or an agent identity and credentials. For more information on authentication, see Authenticate to MCP servers.

For host-specific guidance about setting up and connecting to MCP server, see the following:

For more general guidance, see Connect to remote MCP servers.

Available tools

MCP Tools that are read-only have the MCP attribute mcp.tool.isReadOnly set to true. You might want to only allow read-only tools in certain environments through your organization policy.

To view details of available MCP tools and their descriptions for the Compute Engine MCP server, see the Compute Engine MCP reference.

List tools

Use the MCP inspector to list tools, or send a tools/list HTTP request directly to the Compute Engine remote MCP server. The tools/list method doesn't require authentication.

POST /mcp HTTP/1.1
Host: compute.googleapis.com
Content-Type: application/json

{
  "jsonrpc": "2.0",
  "method": "tools/list",
}

Example use cases

The following sample use cases describe how you can use the Compute Engine MCP server to manage Compute Engine resources:

Inspect and manage resources. For example, to understand resource allocation and configuration in your project, you can list all compute instances. You can also find all running compute instances in a zone that have a specific accelerator attached, and show their location and name for resource management.
Clean up unused resources to reduce operational costs. For example, identify and clean up disk snapshots in a zone that are no longer associated with a source disk, or identify and delete stopped VM instances that have costly GPU resources attached.
Optimize instance performance. For example, resize an under-provisioned VM instance to a larger machine type in the same family, and confirm the successful update.
Provision specialized VMs for AI workloads with zone flexibility. For example, create a VM instance with a specific GPU accelerator attached, in any zone in a specified region where it is available.
Troubleshoot and validate instance configurations. For example, retrieve configuration details for a specific VM instance where the job is frozen, reboot it, and confirm the underlying accelerator and disk are attached.

Sample prompts

The following are sample prompts that you can use to perform tasks by using the Compute Engine MCP server:

List all VMs in PROJECT_ID, including the VM name and zone.
Show the instance details for VM_NAME.
In REGION, find all disk snapshots for which the source disk no longer exists.
Change the machine type of VM_NAME to the next largest machine type in the same machine family, send notification when it's back online, and confirm the new machine type.
Find all running VMs in REGION with NVIDIA accelerators, and show the zone and name for these VMs.
Create a VM in ZONE with an NVIDIA T4 accelerator attached. Name the VM my-nvidiat4-vm.
Find all stopped VMs in REGION with NVIDIA Tesla T4 accelerators, and delete them.

Replace the following:

PROJECT_ID: the Google Cloud project ID.
REGION: the name of the region where your resources exist.
ZONE: the name of the zone where your VMs exist.
VM_NAME: the name of your VM instance.

Optional security and safety configurations

MCP introduces new security risks and considerations due to the wide variety of actions that you can do with the MCP tools. To minimize and manage these risks, Google Cloud offers default settings and customizable policies to control the use of MCP tools in your Google Cloud organization or project.

For more information about MCP security and governance, see AI security and safety.

Use Model Armor

Model Armor is a Google Cloud service that's designed to enhance the security and safety of your AI applications. It works by proactively screening LLM prompts and responses, protecting against various risks and supporting responsible AI practices. Whether you are deploying AI in your cloud environment, or on external cloud providers, Model Armor can help you prevent malicious input, verify content safety, protect sensitive data, maintain compliance, and enforce your AI safety and security policies consistently across your diverse AI landscape.

Model Armor is only available in specific regional locations. If Model Armor is enabled for a project, and a call to that project comes from an unsupported region, Model Armor makes a cross-regional call. For more information, see Model Armor locations.

Enable Model Armor

To enable Model Armor, complete the following steps:

To enable Model Armor on your Google Cloud project, run the following gcloud CLI command:
```
gcloud services enable modelarmor.googleapis.com \
    --project=PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project ID.

To configure the recommended floor settings for Model Armor, run the following gcloud CLI command:

gcloud model-armor floorsettings update \
    --full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
    --mcp-sanitization=ENABLED \
    --malicious-uri-filter-settings-enforcement=ENABLED \
    --pi-and-jailbreak-filter-settings-enforcement=ENABLED \
    --pi-and-jailbreak-filter-settings-confidence-level=MEDIUM_AND_ABOVE

Replace PROJECT_ID with your Google Cloud project ID.

Model Armor is configured to scan for malicious URLs and prompt injection and jailbreak attempts.

For more information about configurable Model Armor filters, see Model Armor filters.

To add Model Armor as a content security provider for MCP services, run the following gcloud CLI command:
```
gcloud beta services mcp content-security add modelarmor.googleapis.com \
    --project=PROJECT_ID
```
Replace PROJECT_ID with the Google Cloud project ID.
To confirm that MCP traffic is sent to Model Armor, run the following command:
```
gcloud beta services mcp content-security get \
    --project=PROJECT_ID
```
Replace PROJECT_ID with the Google Cloud project ID.

Model Armor logging

For information about Model Armor audit and platform logs, see Model Armor audit logging.

Disable Model Armor in a project

To disable Model Armor on a Google Cloud project, run the following command:

gcloud beta services mcp content-security remove modelarmor.googleapis.com \
    --project=PROJECT_ID

Replace PROJECT_ID with the Google Cloud project ID.

MCP traffic on Google Cloud won't be scanned by Model Armor for the specified project.

Disable scanning MCP traffic with Model Armor

If you want to keep using Model Armor in a project, but you want to stop scanning MCP traffic with Model Armor, then run the following command:

gcloud model-armor floorsettings update \
  --full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
  --mcp-sanitization=DISABLED

Replace PROJECT_ID with the Google Cloud project ID.

Model Armor won't scan MCP traffic on Google Cloud.

Organization level MCP control

You can create custom organization policies to control the use of MCP servers in your Google Cloud organization by using the gcp.managed.allowedMCPService constraint. For more information and example use cases, see Google Cloud MCP servers Access control with IAM.

What's next

Read the Compute Engine MCP reference documentation.
Learn more about Google Cloud MCP servers.

Use the Compute Engine remote MCP server Stay organized with collections Save and categorize content based on your preferences.

Before you begin

Check for the roles

Grant the roles

Check for the roles

Grant the roles

Required roles

Roles for using the service

Enable or disable the Compute Engine MCP server

Enable the Compute Engine MCP server in a project

Disable the Compute Engine MCP server in a project

Authentication and authorization

Compute Engine MCP OAuth scopes

Configure an MCP client to use the Compute Engine MCP server

Available tools

List tools

Example use cases

Sample prompts

Optional security and safety configurations

Use Model Armor

Enable Model Armor

Model Armor logging

Disable Model Armor in a project

Disable scanning MCP traffic with Model Armor

Organization level MCP control

What's next

Use the Compute Engine remote MCP server