Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

使用 Compute Engine 遠端 MCP 伺服器

本文說明如何使用 Compute Engine 遠端 Model Context Protocol (MCP) 伺服器，連線至 AI 應用程式，包括 Gemini CLI、ChatGPT、Claude 和您開發的自訂應用程式。Compute Engine 遠端 MCP 伺服器提供完整的功能，可讓 LLM 代理執行各種基礎架構管理工作，包括：

管理虛擬機器 (VM) 執行個體。
管理執行個體群組管理員和執行個體範本。
管理磁碟和快照。

擷取預訂和承諾使用合約的相關資訊。啟用 Compute Engine API 時，系統會啟用 Compute Engine 遠端 MCP 伺服器。

Model Context Protocol (MCP) 規範大型語言模型 (LLM) 和 AI 應用程式/代理程式連結外部資料來源的標準方式。MCP 伺服器可讓您使用工具、資源和提示，從後端服務採取行動及取得更新資料。

本機和遠端 MCP 伺服器有何不同？

本機 MCP 伺服器: 通常在本機電腦上執行，並使用標準輸入和輸出串流 (stdio)，在同一部裝置上的服務之間進行通訊。
遠端 MCP 伺服器: 在服務的基礎架構上執行，並為 AI 應用程式提供 HTTP 端點，供 AI MCP 用戶端與 MCP 伺服器之間的通訊。如要進一步瞭解 MCP 架構，請參閱 MCP 架構。

Google 和 Google Cloud 遠端 MCP 伺服器

Google 和 Google Cloud 遠端 MCP 伺服器具有下列功能和優點：

簡化集中式探索作業
代管全域或區域 HTTP 端點
精細授權
(選用) 使用 Model Armor 保護提示詞和回覆
集中式稽核記錄

如要瞭解其他 MCP 伺服器，以及 Google Cloud MCP 伺服器適用的安全性與控管措施，請參閱 Google Cloud MCP 伺服器總覽。

事前準備

登入 Google Cloud 帳戶。如果您是 Google Cloud新手，歡迎建立帳戶，親自評估產品在實際工作環境中的成效。新客戶還能獲得價值 $300 美元的免費抵免額，可用於執行、測試及部署工作負載。

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Make sure that you have the following role or roles on the project: Compute Instance Admin (v1), Compute Security Admin, Service Account User, Service Usage Admin

Check for the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
In the Principal column, find all rows that identify you or a group that you're included in. To learn which groups you're included in, contact your administrator.
For all rows that specify or include you, check the Role column to see whether the list of roles includes the required roles.

Grant the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
Click Grant access.
In the New principals field, enter your user identifier. This is typically the email address for a Google Account.
Click Select a role, then search for the role.
To grant additional roles, click Add another role and add each additional role.
Click Save.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Make sure that you have the following role or roles on the project: Compute Instance Admin (v1), Compute Security Admin, Service Account User, Service Usage Admin

Check for the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
In the Principal column, find all rows that identify you or a group that you're included in. To learn which groups you're included in, contact your administrator.
For all rows that specify or include you, check the Role column to see whether the list of roles includes the required roles.

Grant the roles

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
Click Grant access.
In the New principals field, enter your user identifier. This is typically the email address for a Google Account.
Click Select a role, then search for the role.
To grant additional roles, click Add another role and add each additional role.
Click Save.

啟用 Compute Engine API。
啟用 Compute Engine API

必要的角色

如要取得使用 Compute Engine 遠端 MCP 伺服器所需的權限，請要求管理員在 Google Cloud 專案中授予下列 IAM 角色：

發出 MCP 工具呼叫： MCP 工具使用者 (roles/mcp.toolUser)

如要進一步瞭解如何授予角色，請參閱「管理專案、資料夾和組織的存取權」。

這些預先定義的角色具備使用 Compute Engine 遠端 MCP 伺服器所需的權限。如要查看確切的必要權限，請展開「Required permissions」(必要權限) 部分：

所需權限

如要使用 Compute Engine 遠端 MCP 伺服器，您必須具備下列權限：

發出 MCP 工具呼叫： mcp.tools.call

您或許還可透過自訂角色或其他預先定義的角色取得這些權限。

您還需要執行 Compute Engine 作業所需的角色和權限。詳情請參閱「Compute Engine 角色和權限」。

驗證及授權

Compute Engine MCP 伺服器使用 OAuth 2.0 通訊協定搭配 Identity and Access Management (IAM) 進行驗證和授權。所有Google Cloud 身分都支援 MCP 伺服器驗證。

建議您為使用 MCP 工具的代理商建立個別身分，以便控管及監控資源存取權。如要進一步瞭解驗證，請參閱「向 MCP 伺服器進行驗證」。

Compute Engine MCP OAuth 範圍

OAuth 2.0 會使用範圍和憑證，判斷經過驗證的主體是否有權對資源執行特定動作。如要進一步瞭解 Google 的 OAuth 2.0 範圍，請參閱「使用 OAuth 2.0 存取 Google API」。

Compute Engine 具有下列 MCP 工具 OAuth 範圍：

gcloud CLI 的範圍 URI	說明
`https://www.googleapis.com/auth/compute.read-only`	僅允許讀取資料的權限。
`https://www.googleapis.com/auth/compute.read-write`	有權讀取及修改資料。

在工具呼叫期間存取的資源可能需要其他範圍。如要查看 Compute Engine 必要的範圍清單，請參閱 Compute Engine API。

設定 MCP 用戶端，以使用 Compute Engine MCP 伺服器

AI 應用程式和代理 (例如 Claude 或 Antigravity) 可以例項化 MCP 用戶端，並連至單一 MCP 伺服器。AI 應用程式可有多個用戶端，連至不同的 MCP 伺服器。如果您的應用程式未列在特定用戶端指南中，則可以使用下列資訊，從大多數應用程式連線。

在 AI 應用程式中，尋找新增或連線至遠端 MCP 伺服器的方法。如果是 Compute Engine MCP 伺服器，請視需要輸入下列資訊：

伺服器名稱：Compute Engine MCP 伺服器
伺服器網址或端點：https://compute.googleapis.com/mcp
傳輸：HTTP
驗證詳細資料：視驗證方式而定，您可以輸入 Google Cloud 憑證、OAuth 用戶端 ID 和密鑰，或是代理程式身分和憑證。如要進一步瞭解驗證，請參閱「向 MCP 伺服器進行驗證」。

如需設定及連線至 MCP 伺服器的應用程式專屬指引，請參閱用戶端專屬指引。

如需更多一般指引，請參閱下列資源：

可用的工具

如要查看 Compute Engine MCP 伺服器的可用 MCP 工具詳細資料和說明，請參閱 Compute Engine MCP 參考資料。

列出工具

使用 MCP 檢查器列出工具，或直接將 tools/list HTTP 要求傳送至 Compute Engine 遠端 MCP 伺服器。tools/list 方法不需要驗證。

POST /mcp HTTP/1.1
Host: compute.googleapis.com
Content-Type: application/json

{
  "jsonrpc": "2.0",
  "method": "tools/list",
}

應用實例

下列範例使用案例說明如何使用 Compute Engine MCP 伺服器管理 Compute Engine 資源：

檢查及管理資源。舉例來說，如要瞭解專案中的資源配置和設定，可以列出所有運算執行個體。您也可以找出區域中所有已附加特定加速器的執行中運算執行個體，並顯示其位置和名稱，方便您管理資源。
清理未使用的資源，降低營運成本。舉例來說，您可以找出並清理區域中不再與來源磁碟相關聯的磁碟快照，或是找出並刪除已停止且附加昂貴 GPU 資源的 VM 執行個體。
最佳化執行個體效能。舉例來說，將資源不足的 VM 執行個體調整為同一系列中較大的機型，並確認更新成功。
為 AI 工作負載佈建專用 VM，並享有可用區彈性。舉例來說，您可以在指定區域中提供 GPU 加速器的任何可用區，建立附加特定 GPU 加速器的 VM 執行個體。
排解及驗證執行個體設定。舉例來說，您可以擷取工作凍結的特定 VM 執行個體設定詳細資料、重新啟動該執行個體，並確認已附加基礎加速器和磁碟。

提示範例

以下是範例提示，可用於透過 Compute Engine MCP 伺服器執行工作：

「列出 PROJECT_ID 中的所有 VM，包括 VM 名稱和可用區。」
「Show the instance details for VM_NAME.」(顯示 VM_NAME 的執行個體詳細資料)。
「在 REGION 中，找出來源磁碟已不存在的所有磁碟快照。」
「將 VM_NAME 的機型變更為相同機器家族中下一個最大的機型，在恢復連線時傳送通知，並確認新的機型。」
「Find all running VMs in REGION with NVIDIA accelerators, and show the zone and name for these VMs.」(找出 REGION 中所有搭載 NVIDIA 加速器的執行中 VM，並顯示這些 VM 的區域和名稱)。
「在 ZONE 中建立 VM，並附加 NVIDIA T4 加速器。將 VM 命名為 my-nvidiat4-vm。
「Find all stopped VMs in REGION with NVIDIA Tesla T4 accelerators, and delete them.」(找出 REGION 中所有已停止的 VM，並刪除這些 VM，這些 VM 必須搭載 NVIDIA Tesla T4 加速器)。

更改下列內容：

PROJECT_ID： Google Cloud 專案 ID。
REGION：資源所在的區域名稱。
ZONE：VM 所在的可用區名稱。
VM_NAME：VM 執行個體的名稱。

選用的安全防護設定

由於 MCP 工具可執行各種動作，因此會帶來新的安全風險和考量。為盡量減少及管理這些風險，Google Cloud 提供預設設定和可自訂的政策，控管機構或專案中 MCP 工具的使用情形。 Google Cloud

如要進一步瞭解 MCP 安全性和控管措施，請參閱這篇文章。

使用 Model Armor

Model Armor 是一項Google Cloud 服務，可提高 AI 應用程式的安全性。這項功能會主動篩選 LLM 提示詞和回覆，防範各種風險並支援負責任的 AI 做法。無論您是在雲端環境或外部雲端供應商部署 AI，Model Armor 都能協助您防範惡意輸入、驗證內容安全性、保護私密資料、維持法規遵循狀態，並在多元的 AI 環境中，持續強制執行 AI 安全政策。

啟用 Model Armor 並啟用記錄功能後，Model Armor 會記錄整個酬載。這可能會導致記錄中出現私密資訊。

啟用 Model Armor

您必須先啟用 Model Armor API，才能使用 Model Armor。

控制台

啟用 Model Armor API。
啟用 API 時所需的角色
如要啟用 API，您需要具備服務使用情形管理員 IAM 角色 (roles/serviceusage.serviceUsageAdmin)，其中包含 serviceusage.services.enable 權限。瞭解如何授予角色。
啟用 API
選取要啟用 Model Armor 的專案。

gcloud

開始前，請使用 Google Cloud CLI 搭配 Model Armor API 執行下列步驟：

在 Google Cloud 控制台中啟用 Cloud Shell。

啟用 Cloud Shell

控制台底部會開啟 Cloud Shell 工作階段，並顯示指令列提示。 Google Cloud Cloud Shell 是已安裝 Google Cloud CLI 的殼層環境，並已針對您目前的專案設定好相關值。工作階段可能要幾秒鐘的時間才能初始化。
執行下列指令，為 Model Armor 服務設定 API 端點。
```
gcloud config set api_endpoint_overrides/modelarmor "https://modelarmor.LOCATION.rep.googleapis.com/"
```
將 LOCATION 替換為要使用 Model Armor 的區域。

設定 Google 和遠端 MCP 伺服器的防護功能 Google Cloud

如要保護 MCP 工具呼叫和回應，可以使用 Model Armor 底限設定。底限設定會定義專案適用的最低安全性篩選條件。這項設定會對專案中的所有 MCP 工具呼叫和回覆套用一致的篩選器。

設定啟用 MCP 消毒功能的 Model Armor 底限設定。詳情請參閱「設定 Model Armor 底價」。

請參閱下列指令範例：

gcloud model-armor floorsettings update \
--full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
--enable-floor-setting-enforcement=TRUE \
--add-integrated-services=GOOGLE_MCP_SERVER \
--google-mcp-server-enforcement-type=INSPECT_AND_BLOCK \
--enable-google-mcp-server-cloud-logging \
--malicious-uri-filter-settings-enforcement=ENABLED \
--add-rai-settings-filters='[{"confidenceLevel": "MEDIUM_AND_ABOVE", "filterType": "DANGEROUS"}]'

將 PROJECT_ID 替換為 Google Cloud 專案 ID。

請注意下列設定：

INSPECT_AND_BLOCK：強制執行類型，可檢查 Google MCP 伺服器的內容，並封鎖符合篩選器的提示和回覆。
ENABLED：啟用篩選器或強制執行的設定。
MEDIUM_AND_ABOVE：負責任的 AI 技術 - 危險篩選器設定的信心水準。您可以修改這項設定，但較低的值可能會導致更多誤判。詳情請參閱「Model Armor 信心水準」。

停用使用 Model Armor 掃描 MCP 流量

如要停止讓 Model Armor 根據專案的底限設定，自動掃描往返 Google MCP 伺服器的流量，請執行下列指令：

gcloud model-armor floorsettings update \
  --full-uri='projects/PROJECT_ID/locations/global/floorSetting' \
  --remove-integrated-services=GOOGLE_MCP_SERVER

將 PROJECT_ID 替換為 Google Cloud 專案 ID。Model Armor 不會自動將這個專案底限設定中定義的規則，套用至任何 Google MCP 伺服器流量。

Model Armor 底限設定和一般設定不僅會影響 MCP，由於 Model Armor 會與 Vertex AI 等服務整合，因此您對底限設定所做的任何變更，都會影響所有整合服務 (不只是 MCP) 的流量掃描和安全行為。

使用 IAM 拒絕政策控管 MCP 使用情形

身分與存取權管理 (IAM) 拒絕政策可協助您保護 Google Cloud 遠端 MCP 伺服器。設定這些政策，即可封鎖不必要的 MCP 工具存取權。

舉例來說，您可以根據下列條件拒絕或允許存取：

主體
工具屬性 (例如唯讀)
應用程式的 OAuth 用戶端 ID

詳情請參閱「使用 Identity and Access Management 控制 MCP 使用情形」。

後續步驟

請參閱 Compute Engine MCP 參考說明文件。
進一步瞭解 Google Cloud MCP 伺服器。

使用 Compute Engine 遠端 MCP 伺服器 透過集合功能整理內容 你可以依據偏好儲存及分類內容。

本機和遠端 MCP 伺服器有何不同？

Google 和 Google Cloud 遠端 MCP 伺服器

事前準備

Check for the roles

Grant the roles

Check for the roles

Grant the roles

必要的角色

所需權限

驗證及授權

Compute Engine MCP OAuth 範圍

設定 MCP 用戶端，以使用 Compute Engine MCP 伺服器

可用的工具

列出工具

應用實例

提示範例

選用的安全防護設定

使用 Model Armor

啟用 Model Armor

控制台

gcloud

設定 Google 和遠端 MCP 伺服器的防護功能 Google Cloud

停用使用 Model Armor 掃描 MCP 流量

使用 IAM 拒絕政策控管 MCP 使用情形

後續步驟

使用 Compute Engine 遠端 MCP 伺服器