Use the GKE remote MCP server

Autopilot Standard

Model Context Protocol (MCP) standardizes the way large language models (LLMs) and AI applications or agents connect to outside data sources. MCP servers let you use their tools, resources, and prompts to take actions and get updated data from their backend service.

Local MCP servers typically run on your local machine and use the standard input and output streams (stdio) for communication between services on the same device. Remote MCP servers run on the service's infrastructure and offer an HTTP endpoint to AI applications for communication between the AI MCP client and the MCP server. For more information on MCP architecture, see MCP architecture.

This document describes how to use the GKE remote Model Context Protocol (MCP) server to connect to GKE from AI applications such as Gemini CLI, agent mode in Gemini Code Assist, Claude Code, or in AI applications you're developing.

For information on the GKE local MCP server, see GKE MCP server on GitHub.

Google and Google Cloud remote MCP servers have the following features and benefits:

Simplified, centralized discovery.
Managed global or regional HTTP endpoints.
Fine-grained authorization.
Optional prompt and response security with Model Armor protection.
Centralized audit logging.

For information about other MCP servers and information about security and governance controls available for Google Cloud MCP servers, see Google Cloud MCP servers overview.

You might want to use the GKE local MCP server for the following reasons:

Local development and testing
Offline MCP use
Cluster and workload creation, including manifest generation for AI/ML workloads
Local client configuration (using kubeconfig)
Query logs
Get cost and security recommendations for your GKE environment

For more information about how to use our local MCP server, see GKE MCP server. The following sections only apply to the GKE remote MCP server.

Before you begin

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

Install the Google Cloud CLI.

Note: If you installed the gcloud CLI previously, make sure you have the latest version by running gcloud components update.

To initialize the gcloud CLI, run the following command:

gcloud init

Create or select a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Create a Google Cloud project:
```
gcloud projects create PROJECT_ID
```
Replace PROJECT_ID with a name for the Google Cloud project you are creating.
Select the Google Cloud project that you created:
```
gcloud config set project PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project name.

Verify that billing is enabled for your Google Cloud project.

Enable the Kubernetes Engine API:

Roles required to enable APIs

To enable APIs, you need the Service Usage Admin IAM role (roles/serviceusage.serviceUsageAdmin), which contains the serviceusage.services.enable permission. Learn how to grant roles.

gcloud services enable container.googleapis.com

Grant roles to your user account. Run the following command once for each of the following IAM roles: roles/container.clusterViewer

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:USER_IDENTIFIER" --role=ROLE

Replace the following:

PROJECT_ID: Your project ID.
USER_IDENTIFIER: The identifier for your user account. For example, myemail@example.com.
ROLE: The IAM role that you grant to your user account.

Install the Google Cloud CLI.

Note: If you installed the gcloud CLI previously, make sure you have the latest version by running gcloud components update.

To initialize the gcloud CLI, run the following command:

gcloud init

Create or select a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Create a Google Cloud project:
```
gcloud projects create PROJECT_ID
```
Replace PROJECT_ID with a name for the Google Cloud project you are creating.
Select the Google Cloud project that you created:
```
gcloud config set project PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project name.

Verify that billing is enabled for your Google Cloud project.

Enable the Kubernetes Engine API:

Roles required to enable APIs

To enable APIs, you need the Service Usage Admin IAM role (roles/serviceusage.serviceUsageAdmin), which contains the serviceusage.services.enable permission. Learn how to grant roles.

gcloud services enable container.googleapis.com

Grant roles to your user account. Run the following command once for each of the following IAM roles: roles/container.clusterViewer

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:USER_IDENTIFIER" --role=ROLE

Replace the following:

PROJECT_ID: Your project ID.
USER_IDENTIFIER: The identifier for your user account. For example, myemail@example.com.
ROLE: The IAM role that you grant to your user account.

Required roles

To perform the one-time setup of enabling the GKE remote MCP server, an administrator needs the following roles:

Service Usage Admin (roles/serviceusage.serviceUsageAdmin): Grant this role on your Google Cloud project to allow enabling the remote MCP service endpoint. This role includes the serviceusage.mcppolicy.get and serviceusage.mcppolicy.update permissions.

For more information about granting roles, see Manage access to projects, folders, and organizations.

Roles for using the service

The principal that makes calls to the remote MCP server tools needs permissions to access GKE resources. This principal can be a human user or an automated service account. At a minimum, grant the following role on your Google Cloud project:

MCP Tool User (roles/mcp.toolUser): Grants permission to make tool calls to the MCP server endpoint.
Kubernetes Engine Cluster Viewer (roles/container.clusterViewer): This role provides the read-only access needed for the remote server's tools.

Grant this role to:

A user account when a person is interacting with the MCP server through a client like the Gemini CLI.
A service account when building an autonomous agent or application that calls the MCP server.

Enable or disable the GKE remote MCP server

You can enable or disable the GKE remote MCP server in a project with the gcloud beta services mcp enable command. For more information, see the following sections.

Enable the GKE remote MCP server in a project

If you are using different projects for your client credentials, such as service account keys, OAuth client ID or API keys, and for hosting your resources, then you must enable the GKE service and the GKE remote MCP server on both projects.

To enable the GKE remote MCP server in your Google Cloud project, run the following command:

gcloud beta services mcp enable container.googleapis.com \
    --project=PROJECT_ID

Replace PROJECT_ID with the Google Cloud project ID.

The GKE remote MCP server is enabled for use in your Google Cloud Project. If the GKE service isn't enabled for your Google Cloud project, you are prompted to enable the service before enabling the GKE remote MCP server.

As a security best practice, we recommend that you enable MCP servers only for the services required for your AI application to function.

Disable the GKE remote MCP server in a project

To disable the GKE remote MCP server in your Google Cloud project, run the following command:

gcloud beta services mcp disable SERVICE \
    --project=PROJECT_ID

The GKE remote MCP server is disabled for use in your Google Cloud Project.

Authentication and authorization

GKE remote MCP servers use the OAuth 2.0 protocol with Identity and Access Management (IAM) for authentication and authorization. All Google Cloud identities are supported for authentication to MCP servers.

The GKE remote MCP server does not accept API keys for authentication.

We recommend creating a separate identity for agents using MCP tools so that access to resources can be controlled and monitored. For more information on authentication, see Authenticate to MCP servers.

GKE remote MCP OAuth scopes

OAuth 2.0 uses scopes and credentials to determine if an authenticated principal is authorized to take a specific action on a resource. For more information about OAuth 2.0 scopes at Google, read Using OAuth 2.0 to access Google APIs.

GKE has the following MCP tool OAuth scopes:

Scope URI for gcloud CLI	Description
`https://www.googleapis.com/auth/cloud-platform`	Grants broad, read-only access to your Google Cloud projects.

Additional scopes might be required on the resources accessed during a tool call. To view a list of scopes required for GKE, see GKE API.

Configure an MCP client to use the GKE MCP server

Host programs, such as Claude or Gemini CLI, can instantiate MCP clients that connect to a single MCP server. A host program can have multiple clients that connect to different MCP servers. To connect to a remote MCP server, the MCP client must know at a minimum the URL of the remote MCP server.

In your host, look for a way to connect to a remote MCP server. You are prompted to enter details about the server, such as its name and URL.

For the GKE remote MCP server, enter the following as required:

Server name: GKE remote MCP server
Server URL or Endpoint: https://container.googleapis.com/mcp
Transport: HTTP
Authentication details: Depending on how you want to authenticate, you can enter your Google Cloud credentials, your OAuth Client ID and secret, or an agent identity and credentials. For more information on authentication, see Authenticate to MCP servers.

For host specific guidance, see the following:

For more general guidance, see Connect to remote MCP servers.

Available tools

To view details of available MCP tools and their descriptions for the GKE MCP server, see the GKE MCP reference.

List tools

Use the MCP inspector to list tools, or send a tools/list HTTP request directly to the GKE remote MCP server. The tools/list method doesn't require authentication.

POST /mcp HTTP/1.1
Host: container.googleapis.com
Content-Type: application/json

{
  "jsonrpc": "2.0",
  "method": "tools/list",
}

Sample use cases

The following are sample use cases for the GKE remote MCP server:

Inspect the configuration and status of your GKE clusters and node pools. For example, use the prompt: "Show me the details of my 'production-cluster' and list all of its node pools."
View Kubernetes resource configurations and container logs from within a cluster without using kubectl. For example, use the prompt: "Get the YAML for the 'frontend-deployment' in the 'default' namespace."
Monitor the status of long-running GKE operations, such as cluster upgrades. For example, use the prompt: "List all the GKE operations in my project from the last hour."

Optional security and safety configurations

MCP introduces new security risks and considerations due to the wide variety of actions that can be taken with MCP tools. To minimize and manage these risks, Google Cloud offers defaults and customizable policies to control the use of MCP tools in your Google Cloud organization or project.

For more information about MCP security and governance, see AI security and safety.

Use Model Armor

Model Armor is a Google Cloud service designed to enhance the security and safety of your AI applications. It works by proactively screening LLM prompts and responses, protecting against various risks and supporting responsible AI practices. Whether you are deploying AI in your cloud environment, or on external cloud providers, Model Armor can help you prevent malicious input, verify content safety, protect sensitive data, maintain compliance, and enforce your AI safety and security policies consistently across your diverse AI landscape.

Model Armor is only available in specific regional locations. If Model Armor is enabled for a project, and a call to that project comes from an unsupported region, Model Armor makes a cross-regional call. For more information, see Model Armor locations.

Enable Model Armor

You must enable Model Armor APIs before you can use Model Armor.

Console

Enable the Model Armor API.
Roles required to enable APIs
To enable APIs, you need the Service Usage Admin IAM role (roles/serviceusage.serviceUsageAdmin), which contains the serviceusage.services.enable permission. Learn how to grant roles.
Enable the API
Select the project where you want to activate Model Armor.

gcloud

Before you begin, follow these steps using the Google Cloud CLI with the Model Armor API:

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.
Run the following command to set the API endpoint for the Model Armor service.
```
gcloud config set api_endpoint_overrides/modelarmor "https://modelarmor.LOCATION.rep.googleapis.com/"
```
Replace LOCATION with the region where you want to use Model Armor.

Configure protection for Google and Google Cloud remote MCP servers

You can protect your MCP tool calls and responses by using Model Armor floor settings. A floor setting defines the minimum security filters that apply across the project. This configuration applies a consistent set of filters to all MCP tool calls and responses within the project.

Set up a Model Armor floor setting with MCP sanitization enabled. For more information, see Configure Model Armor floor settings.

See the following example command:

gcloud model-armor floorsettings update \
--full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
--enable-floor-setting-enforcement=TRUE \
--add-integrated-services=GOOGLE_MCP_SERVER \
--google-mcp-server-enforcement-type=INSPECT_AND_BLOCK \
--enable-google-mcp-server-cloud-logging \
--malicious-uri-filter-settings-enforcement=ENABLED \
--add-rai-settings-filters='[{"confidenceLevel": "MEDIUM_AND_ABOVE", "filterType": "DANGEROUS"}]'

Replace PROJECT_ID with your Google Cloud project ID.

Note the following settings:

INSPECT_AND_BLOCK: The enforcement type that inspects content for the Google MCP server and blocks prompts and responses that match the filters.
ENABLED: The setting that enables a filter or enforcement.
MEDIUM_AND_ABOVE: The confidence level for the Responsible AI - Dangerous filter settings. You can modify this setting, though lower values might result in more false positives. For more information, see Model Armor confidence levels.

Disable scanning MCP traffic with Model Armor

If you want to stop scanning Google MCP traffic with Model Armor, run the following command:

gcloud model-armor floorsettings update \
  --full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
  --remove-integrated-services=GOOGLE_MCP_SERVER

Replace PROJECT_ID with the Google Cloud project ID.

Model Armor won't scan MCP traffic in the project.

Control MCP use with IAM deny policies

Identity and Access Management (IAM) deny policies help you secure Google Cloud remote MCP servers. Configure these policies to block unwanted MCP tool access.

For example, you can deny or allow access based on:

The principal.
Tool properties like read-only.
The application's OAuth client ID.

For more information, see Control MCP use with Identity and Access Management.

What's next

Read the GKE remote MCP reference documentation.
Learn more about Google Cloud MCP servers.

Use the GKE remote MCP server Stay organized with collections Save and categorize content based on your preferences.

Before you begin

Required roles

Roles for using the service

Enable or disable the GKE remote MCP server

Enable the GKE remote MCP server in a project

Disable the GKE remote MCP server in a project

Authentication and authorization

GKE remote MCP OAuth scopes

Configure an MCP client to use the GKE MCP server

Available tools

List tools

Sample use cases

Optional security and safety configurations

Use Model Armor

Enable Model Armor

Console

gcloud

Configure protection for Google and Google Cloud remote MCP servers

Disable scanning MCP traffic with Model Armor

Control MCP use with IAM deny policies

What's next

Use the GKE remote MCP server