Known issues

This page describes known issues that you might run into while using Compute Engine. For issues that specifically affect Confidential VMs, see Confidential VM limitations.

General issues

The following issues provide troubleshooting guidance or general information.

Resolved: Modifying the IOPS or throughput on an Asynchronous Replication primary disk using the gcloud compute disks update command causes a false error

The following issue was resolved on June 1, 2025.

When you use the gcloud compute disks update command to modify the IOPS and throughput on an Asynchronous Replication primary disk, the gcloud CLI shows an error message even if the update was successful.

To accurately verify that an update was successful, use the gcloud CLI or the Google Cloud console to see if the disk properties show the new IOPS and throughput values. For more information, see View the provisioned performance settings for Hyperdisk.

The metadata server might display old physicalHost VM metadata

After experiencing a host error which moves an instance to a new host, when you query the metadata server, it might display the physicalHost metadata of the instance's previous host.

To workaround this issue, do one of the following:

Long baseInstanceName values in managed instance groups (MIGs) can cause disk name conflicts

In a MIG, disk name conflicts can occur if the instance template specifies disks to be created upon VM creation and the baseInstanceName value exceeds 54 characters. This happens because Compute Engine generates disk names using the instance name as a prefix.

When generating disk names, if the resulting name exceeds the resource name limit of 63 characters, Compute Engine truncates the excess characters from the end of instance name. This truncation can lead to the creation of identical disk names for instances that have similar naming patterns. In such a case, the new instance will attempt to attach the existing disk. If the disk is already attached to another instance, the new instance creation fails. If the disk is not attached or is in multi-writer mode, the new instance will attach the disk, potentially leading to data corruption.

To avoid disk name conflicts, keep the baseInstanceName value to a maximum length of 54 characters.

Creating reservations or future reservation requests using an instance template that specifies an A2, C3, or G2 machine type causes issues

If you use an instance template that specifies an A2, C3, or G2 machine type to create a reservation, or to create and submit a future reservation request for review, you encounter issues. Specifically:

  • Creating the reservation might fail. If it succeeds, then one of the following applies:

    • If you created an automatically consumed reservation (default), creating VMs with matching properties won't consume the reservation.

    • If you created a specific reservation, creating VMs to specifically target the reservation fails.

  • Creating the future reservation request succeeds. However, if you submit it for review, Google Cloud declines your request.

You can't replace the instance template used to create a reservation or future reservation request, or override the template's VM properties. If you want to reserve resources for A2, C3, or G2 machine types, do one of the following instead:

Limitations when using -lssd machine types with Google Kubernetes Engine

When using the Google Kubernetes Engine API, the node pool with Local SSD attached that you provision must have the same number of SSD disks as the selected C4, C3, or C3D machine type. For example, if you plan to create a VM that uses the c3-standard-8-lssd there must be 2 SSD disks, whereas for a c3d-standard-8-lssd, just 1 SSD disk is required. If the disk number doesn't match you will get a Local SSD misconfiguration error from the Compute Engine control plane. See Machine types that automatically attach Local SSD disks to select the correct number of Local SSD disks based on the lssd machine type.

Using the Google Kubernetes Engine Google Cloud console to create a cluster or node pool with c4-standard-*-lssd, c4-highmem-*-lssd, c3-standard-*-lssd and c3d-standard-*-lssd VMs results in node creation failure or a failure to detect Local SSDs as ephemeral storage.

Single flow TCP throughput variability on C3D VMs

C3D VMs larger than 30 vCPUs might experience single flow TCP throughput variability and occasionally be limited to 20-25 Gbps. To achieve higher rates, use multiple tcp flows.

The CPU utilization observability metric is incorrect for VMs that use one thread per core

If your VM's CPU uses one thread per core, the CPU utilization Cloud Monitoring observability metric in the Compute Engine > VM instances > Observability tab only scales to 50%. Two threads per core is the default for all machine types, except Tau T2D. For more information, see Set number of threads per core.

To view your VM's CPU utilization normalized to 100%, view CPU utilization in Metrics Explorer instead. For more information, see Create charts with Metrics Explorer.

Google Cloud console SSH-in-browser connections might fail if you use custom firewall rules

If you use custom firewall rules to control SSH access to your VM instances, you might not be able to use the SSH-in-browser feature.

To work around this issue, do one of the following:

Temporary names for disks

During virtual machine (VM) instance updates initiated using the gcloud compute instances update command or the instances.update API method, Compute Engine might temporarily change the name of your VM's disks, by adding of the following suffixes to the original name:

  • -temp
  • -old
  • -new

Compute Engine removes the suffix and restores the original disk names as the update completes.

Increased latency for some Persistent Disks caused by disk resizing

In some cases, resizing large Persistent Disks (~3 TB or larger) might be disruptive to the I/O performance of the disk. If you are impacted by this issue, your Persistent Disks might experience increased latency during the resize operation. This issue can impact Persistent Disks of any type.

Your automated processes might fail if they use API response data about your resource-based commitment quotas

Your automated processes that consume and use API response data about your Compute Engine resource-based commitment quotas might fail if each of the following things happens. Your automated processes can include any snippets of code, business logic, or database fields that use or store the API responses.

  1. The response data is from any of the following Compute Engine API methods:

  2. You use an int instead of a number to define the field for your resource quota limit in your API response bodies. You can find the field in the following ways for each method:

  3. You have unlimited default quota available for any of your Compute Engine committed SKUs.

    For more information about quotas for commitments and committed SKUs, see Quotas for commitments and committed resources.

Root cause

When you have limited quota, if you define the items[].quotas[].limit or quotas[].limit field as an int type, the API response data for your quota limits might still fall within the range for int type and your automated process might not get disrupted. But when the default quota limit is unlimited, Compute Engine API returns a value for the limit field that falls outside of the range defined by int type. Your automated process can't consume the value returned by the API method and fails as a result.

How to work around this issue

You can work around this issue and continue generating your automated reports in the following ways:

  • Recommended: Follow the Compute Engine API reference documentation and use the correct data types for the API method definitions. Specifically, use the number type to define the items[].quotas[].limit and quotas[].limit fields for your API methods.

  • Decrease your quota limit to a value under 9,223,372,036,854,775,807. You must set quota caps for all projects that have resource-based commitments, across all regions. You can do this in one of the following ways:

Known issues for bare metal instances

These are the known issues for Compute Engine bare metal instances.

C4D bare metal doesn't support SUSE Linux 15 SP6 images

C4D bare metal instances can't run the SUSE Linux Enterprise Server (SLES) version 15 SP6 OS.

Workaround

Use SLES 15 SP5 instead.

Simulating host maintenance doesn't work for C4 bare metal instances

The c4-standard-288-metal and c4-highmem-288-metal machine types don't support simulating host maintenance events.

Workaround

You can use VM instances created using other C4 machine types to simulate maintenance events.

  1. Create a VM instance using a C4 machine type that doesn't end in -metal.

    When creating the VM instance, configure the C4 VM to Terminate instead of using Live Migration during host maintenance events.

  2. Simulate a host maintenance event for this VM.

During a simulated host maintenance event, the behavior for VMs configured to Terminate is the same behavior as for C4 bare metal instances.

Lower than expected performance with Z3 bare metal instances on RHEL 8

When using Red Hat Enterprise Linux (RHEL) version 8 with a Z3 bare metal instance, the network performance is lower than expected.

Root cause

There is a missing Page Pool feature in the Linux kernel version (4.18) that is used by RHEL 8.

Workaround

Use a more recent version of RHEL or a different operating system when you are working with Z3 bare metal instances.

Issues related to using Dynamic Network Interfaces

This section describes known issues related to using multiple network interfaces and Dynamic Network Interfaces.

Guest agent version 20250901.00 or later doesn't install Dynamic NICs

If you attempt to Configure automatic management of Dynamic NICs and your instance is running the guest agent at version 20250901.00 or later, the guest agent fails to install and manage Dynamic NICs in the guest OS of your instance.

You might receive an error that includes Cannot find device when running commands in the guest OS that reference Dynamic NICs.

Root cause

Starting with version 20250901.00, the guest agent is migrated to a new plugin-based architecture to improve modularity. The new architecture doesn't support the automatic installation and management of Dynamic NICs.

Workaround

See the following:

  • To resolve this issue for instances that already have Dynamic NICs, follow the instructions in Backward compatibility to revert to the previous guest agent architecture.
  • To prevent this issue from happening, before creating instances with Dynamic NICs or adding Dynamic NICs to existing instances, follow the instructions in Backward compatibility to revert to the previous guest agent architecture.

Packet loss with custom MTU sizes

A Dynamic NIC with a parent vNIC might experience packet loss with custom MTU sizes.

Workaround

To avoid packet loss, use one of the following MTU sizes:

  • 1,460 bytes (the default value for VPC networks)
  • 1,500 bytes (standard Ethernet)
  • 8,896 bytes (the maximum possible value)

Firewall interactions when reusing a VLAN ID with Dynamic NICs

On an instance, reusing a VLAN ID for a new Dynamic NIC has firewall connection tracking implications. If you delete a Dynamic NIC and create a replacement Dynamic NIC that uses the same VLAN ID, firewall connection tracking table entries aren't automatically cleared. The following example shows the relevant security considerations:

  1. A compute instance has an exaxmple Dynamic NIC with VLAN ID 4 connected to a subnet in the network-1 VPC network.
  2. The instance sends packets to the 192.0.2.7:443 destination IP address and destination port. Applicable egress firewall rules allow the outbound connection.
  3. Because Cloud NGFW rules are stateful, the allowed outbound connection creates a firewall connection tracking table entry that permits inbound packets from source IP address and source port 192.0.2.7:443.
  4. You delete the example Dynamic NIC and create a replacement Dynamic NIC. The replacement Dynamic NIC also uses VLAN ID 4 but connects to a subnet in a different VPC network, network-2.
  5. All unexpired firewall connection tracking table entries that were applicable to the original Dynamic NIC are applicable to the replacement Dynamic NIC. This means that a resource in the network-2 VPC network can send packets whose sources match 192.0.2.7:443, and the compute instance accepts them without needing to evaluate ingress firewall rules.

For more information about connection tracking and firewalls rules, see Specifications in the Cloud Next Generation Firewall documentation.

Solution

On a per-instance basis, if you need to create a Dynamic NIC that uses the same VLAN ID as a Dynamic NIC that was removed from the instance:

  • Wait at least 10 minutes between deleting the original Dynamic NIC and creating the replacement Dynamic NIC.
  • Stop the instance, delete the original Dynamic NIC and create the replacement Dynamic NIC, then start the instance.

Packet interception can result in dropped packets due to missing VLAN tags in the Ethernet headers

Packet interception when using Dynamic NIC can result in dropped packets. Dropped packets can happen when the pipeline is terminated early. The issue affects both session-based and non-session-based modes.

Root cause

Dropped packets occur during packet interception when the pipeline is terminated early (ingress intercept and egress reinject). The early termination causes the VLAN ID to be missing from the ingress packet's Ethernet header. Because the egress packet is derived from the modified ingress packet, the egress packet also lacks the VLAN ID. This leads to incorrect endpoint index selection and subsequent packet drops.

Workaround

Don't use Google Cloud features that rely on packet intercept, such as firewall endpoints.

Known issues for Linux VM instances

These are the known issues for Linux VMs.

Debian 11 VMs that use OS image version prior to v20250728 fail to run apt update

On July 22, 2025, the Debian community removed Debian 11 (Bullseye) backports from the main Debian upstream. This update causes sudo apt update to fail with the following error:

The repository 'https://deb.debian.org/debian bullseye-backports Release' does
not have a Release file.

Root cause

Because the Debian community removed the backports repositories from the main upstream, there is no longer any reference to bullseye-backports Release.

Resolution

Use image version debian-11-bullseye-v20250728 or newer. These versions don't contain the backports repositories. Alternatively, you can update current instances by modifying /etc/apt/sources.list:

  • To update the repository URL and use the archive for bullseye-backports:

    sudo sed -i 's/^deb https:\/\/deb.debian.org\/debian bullseye-backports main$/deb https:\/\/archive.debian.org\/debian bullseye-backports main/g; s/^deb-src https:\/\/deb.debian.org\/debian bullseye-backports main$/deb-src https:\/\/archive.debian.org\/debian bullseye-backports main/g' /etc/apt/sources.list
    
  • To delete the repository URL and discard bullseye-backports:

    sudo sed -i '/^deb https:\/\/deb.debian.org\/debian bullseye-backports main$/d; /^deb-src https:\/\/deb.debian.org\/debian bullseye-backports main$/d' /etc/apt/sources.list
    

Installing ubuntu-desktop package breaks VM network upon restarting

After installing ubuntu-desktop package on an Ubuntu VM, the following workaround must be executed before restarting the VM:

echo -e 'network:\n  version: 2\n  renderer: networkd' | sudo tee /etc/netplan/99-gce-renderer.yaml

Otherwise, network interfaces might not be correctly configured upon restarting.

Root cause

The ubuntu-deskop package pulls ubuntu-settings as a dependency, which sets NetworkManager as the default "renderer" for netplan. More specifically, it inserts a new YAML configuration for netplan at /usr/lib/netplan/00-network-manager-all.yaml containing the following:

network:
  version: 2
  renderer: NetworkManager

This configuration conflicts with our networkd-based early provisioning using cloud-init.

Recovery

If the VM has been restarted and is inaccessible, then do the following:

  1. Follow the instructions on rescuing a VM
  2. After mounting the inaccessible VM's Linux file system partition, run the following command (replacing /rescue with your mount point):

    echo -e 'network:\n  version: 2\n  renderer: networkd' | sudo tee /rescue/etc/netplan/99-gce-renderer.yaml
    
  3. Continue with instructions on booting the inaccessible VM back

Ubuntu VMs that use OS Image version v20250530 show incorrect FQDN

You might see an incorrect Fully Qualified Domain Name (FQDN) with the addition of .local suffix when you do one of the following:

  • Update to version 20250328.00 of the google-compute-engine package.
  • Launch instances from any Canonical offered Ubuntu image with the version suffix v20250530. For example, projects/ubuntu-os-cloud/global/images/ubuntu-2204-jammy-v20250530.

If you experience this issue, you might see a FQDN similar to the following:

   [root@ubuntu2204 ~]# apt list --installed | grep google
   ...
   google-compute-engine/noble-updates,now 20250328.00-0ubuntu2~24.04.0 all [installed]
   ...

   [root@ubuntu2204 ~]# curl "http://metadata.google.internal/computeMetadata/v1/instance/image" -H "Metadata-Flavor: Google"
   projects/ubuntu-os-cloud/global/images/ubuntu-2204-jammy-v20250530

   [root@ubuntu2204 ~]# hostname -f
   ubuntu2204.local

Root cause

On all Ubuntu images with version v20250530, the guest-config package version 20250328.00 adds local to the search path due to the introduction of a new configuration file: https://github.com/GoogleCloudPlatform/guest-configs/blob/20250328.00/src/etc/systemd/resolved.conf.d/gce-resolved.conf

   [root@ubuntu2204 ~]# cat /etc/resolv.conf
   # This is /run/systemd/resolve/stub-resolv.conf managed by man:systemd-resolved(8).
   # Do not edit.
   ...

   nameserver 127.0.0.53
   options edns0 trust-ad
   search local ... google.internal

The presence of this local entry within the search path in the /etc/resolv.conf file results in a .local element being appended to the hostname when a FQDN is requested.

Note that version 20250501 of guest-configs already fixes the issue but Canonical hasn't incorporated the fix into their images yet.

Workaround

  1. Modify the Network Name Resolution configuration file /etc/systemd/resolved.conf.d/gce-resolved.conf by changing Domains=local to Domains=~local
  2. Run the following command to restart the systemd-resolved service: systemctl restart systemd-resolved
  3. Check that local is removed from the search path in /etc/resolv.conf
  4. Confirm the FQDN by using the command hostname -f

    [root@ubuntu2204 ~]# hostname -f
    ubuntu2204.us-central1-a.c.my-project.internal
    

Missing mkfs.ext4 on openSUSE Images

Recent v20250724 release of openSUSE images (starting with opensuse-leap-15-6-v20250724-x86-64) from August 2025 is missing the e2fsprogs package, which provides utilities for managing file systems. A common symptom of this issue is that you see an error message such as command not found when you attempt to use the mkfs.ext4 command.

Workaround

If you encounter this issue, install the missing package manually by using the openSUSE package manager, zypper.

# Update the package index
user@opensuse:~> sudo zypper refresh

# Install the e2fsprogs package
user@opensuse:~> sudo zypper install e2fsprogs

# Verify the installation
user@opensuse:~> which mkfs.ext4

SUSE Enterprise VMs fail to boot after changing instances types

After changing a SUSE Linux Enterprise VM's instance type it can fail to boot with the following error seen repeating in the serial console:

            Starting [0;1;39mdracut initqueue hook[0m...
   [  136.146065] dracut-initqueue[377]: Warning: dracut-initqueue: timeout, still waiting for following initqueue hooks:
   [  136.164820] dracut-initqueue[377]: Warning: /lib/dracut/hooks/initqueue/finished/devexists-\x2fdev\x2fdisk\x2fby-uuid\x2fD3E2-0CEB.sh: "[ -e "/dev/disk/by-uuid/D3E2-0CEB" ]"
   [  136.188732] dracut-initqueue[377]: Warning: /lib/dracut/hooks/initqueue/finished/devexists-\x2fdev\x2fdisk\x2fby-uuid\x2fe7b218a9-449d-4477-8200-a7bb61a9ff4d.sh: "if ! grep -q After=remote-fs-pre.target /run/systemd/generator/systemd-cryptsetup@*.service 2>/dev/null; then
   [  136.220738] dracut-initqueue[377]:     [ -e "/dev/disk/by-uuid/e7b218a9-449d-4477-8200-a7bb61a9ff4d" ]
   [  136.240713] dracut-initqueue[377]: fi"

Root cause

SUSE creates its cloud images with a versatile initramfs (initial RAM filesystem) that supports various instance types. This is achieved by using the --no-hostonly --no-hostonly-cmdline -o multipath flags during the initial image creation. However, when a new kernel is installed or the initramfs is regenerated, which happens during system updates, these flags are omitted by default. This results in a smaller initramfs tailored specifically for the current system's hardware, potentially excluding drivers needed for other instance types.

For example, C3 instances use NVMe drives, which require specific modules to be included in the initramfs. If a system with an initramfs lacking these NVMe modules is migrated to a C3 instance, the boot process fails. This issue can also affect other instance types with unique hardware requirements.

Workaround

Before changing the machine type, regenerate the initramfs with all drivers:

dracut --force --no-hostonly

If the system is already impacted by the issue create a temporary rescue VM. Use the chroot command to access the impacted VM's boot disk then regenerate the initramfs using the following command:

dracut -f --no-hostonly

Lower IOPS performance for Local SSD on Z3 with SUSE 12 images

Z3 VMs on SUSE Linux Enterprise Server (SLES) 12 images have significantly less than expected performance for IOPS on Local SSD disks.

Root cause

This is an issue within the SLES 12 codebase.

Workaround

A patch from SUSE to fix this issue is not available or planned. Instead, you should use the SLES 15 operating system.

RHEL 7 and CentOS VMs lose network access after reboot

If your CentOS or RHEL 7 VMs have multiple network interface cards (NICs) and one of these NICs doesn't use the VirtIO interface, then network access might be lost on reboot. This happens because RHEL doesn't support disabling predictable network interface names if at least one NIC doesn't use the VirtIO interface.

Resolution

Network connectivity can be restored by stopping and starting the VM until the issue resolves. Network connectivity loss can be prevented from reoccurring by doing the following:

  1. Edit the /etc/default/grub file and remove the kernel parameters net.ifnames=0 and biosdevname=0.

  2. Regenerate the grub configuration.

  3. Reboot the VM.

repomd.xml signature couldn't be verified

On Red Hat Enterprise Linux (RHEL) or CentOS 7 based systems, you might see the following error when trying to install or update software using yum. This error shows that you have an expired or incorrect repository GPG key.

Sample log:

[root@centos7 ~]# yum update


...

google-cloud-sdk/signature                                                                  | 1.4 kB  00:00:01 !!!
https://packages.cloud.google.com/yum/repos/cloud-sdk-el7-x86_64/repodata/repomd.xml: [Errno -1] repomd.xml signature could not be verified for google-cloud-sdk
Trying other mirror.

...

failure: repodata/repomd.xml from google-cloud-sdk: [Errno 256] No more mirrors to try.
https://packages.cloud.google.com/yum/repos/cloud-sdk-el7-x86_64/repodata/repomd.xml: [Errno -1] repomd.xml signature could not be verified for google-cloud-sdk

Resolution

To fix this, disable repository GPG key checking in the yum repository configuration by setting repo_gpgcheck=0. In supported Compute Engine base images this setting might be found in /etc/yum.repos.d/google-cloud.repo file. However, your VM can have this set in different repository configuration files or automation tools.

Yum repositories don't usually use GPG keys for repository validation. Instead, the https endpoint is trusted.

To locate and update this setting, complete the following steps:

  1. Look for the setting in your /etc/yum.repos.d/google-cloud.repo file.

    cat /etc/yum.repos.d/google-cloud.repo
    
    
    [google-compute-engine]
    name=Google Compute Engine
    baseurl=https://packages.cloud.google.com/yum/repos/google-compute-engine-el7-x86_64-stable
    enabled=1
    gpgcheck=1
    repo_gpgcheck=1
    gpgkey=https://packages.cloud.google.com/yum/doc/yum-key.gpg
       https://packages.cloud.google.com/yum/doc/rpm-package-key.gpg
    [google-cloud-sdk]
    name=Google Cloud SDK
    baseurl=https://packages.cloud.google.com/yum/repos/cloud-sdk-el7-x86_64
    enabled=1
    gpgcheck=1
    repo_gpgcheck=1
    gpgkey=https://packages.cloud.google.com/yum/doc/yum-key.gpg
       https://packages.cloud.google.com/yum/doc/rpm-package-key.gpg
    
    
  2. Change all lines that say repo_gpgcheck=1 to repo_gpgcheck=0.

    sudo sed -i 's/repo_gpgcheck=1/repo_gpgcheck=0/g' /etc/yum.repos.d/google-cloud.repo
  3. Check that the setting is updated.

    cat /etc/yum.repos.d/google-cloud.repo
    
    [google-compute-engine]
    name=Google Compute Engine
    baseurl=https://packages.cloud.google.com/yum/repos/google-compute-engine-el7-x86_64-stable
    enabled=1
    gpgcheck=1
    repo_gpgcheck=0
    gpgkey=https://packages.cloud.google.com/yum/doc/yum-key.gpg
       https://packages.cloud.google.com/yum/doc/rpm-package-key.gpg
    [google-cloud-sdk]
    name=Google Cloud SDK
    baseurl=https://packages.cloud.google.com/yum/repos/cloud-sdk-el7-x86_64
    enabled=1
    gpgcheck=1
    repo_gpgcheck=0
    gpgkey=https://packages.cloud.google.com/yum/doc/yum-key.gpg
       https://packages.cloud.google.com/yum/doc/rpm-package-key.gpg
    

Instances using OS Login return a login message after connection

On some instances that use OS Login, you might receive the following error message after the connection is established:

/usr/bin/id: cannot find name for group ID 123456789

Resolution

Ignore the error message.

Known issues for Windows VM instances

  • Support for NVMe on Windows using the Community NVMe driver is in Beta, and the performance might not match that of Linux instances. The Community NVMe driver has been replaced with the Microsoft StorNVMe driver in Google Cloud public images. We recommend that you replace the NVME driver on VMs created before May 2022 and use the Microsoft StorNVMe driver instead.
  • After you create an instance, you cannot connect to it instantly. All new Windows instances use the System preparation (sysprep) tool to set up your instance, which can take 5–10 mins to complete.
  • Windows Server images cannot activate without a network connection to kms.windows.googlecloud.com and stop functioning if they don't initially authenticate within 30 days. Software activated by the KMS must reactivate every 180 days, but the KMS attempts to reactivate every 7 days. Make sure to configure your Windows instances so that they remain activated.
  • Kernel software that accesses non-emulated model specific registers will generate general protection faults, which can cause a system crash depending on the guest operating system.

Windows Server 2025 and Windows 11 24h2 - Suspend and resume support

Windows Server 2025 and Windows 11 24h2 are unable to resume when suspended. Until this issue is resolved, the suspend and resume feature won't be supported for Windows Server 2025 and Windows 11 24h2.

Errors when measuring NTP time drift using w32tm on Windows VMs

For Windows VMs on Compute Engine running VirtIO NICs, there is a known bug where measuring NTP drift produces errors when using the following command:

w32tm /stripchart /computer:metadata.google.internal

The errors appear similar to the following:

Tracking metadata.google.internal [169.254.169.254:123].
The current time is 11/6/2023 6:52:20 PM.
18:52:20, d:+00.0007693s o:+00.0000285s  [                  *                  ]
18:52:22, error: 0x80072733
18:52:24, d:+00.0003550s o:-00.0000754s  [                  *                  ]
18:52:26, error: 0x80072733
18:52:28, d:+00.0003728s o:-00.0000696s  [                  *                  ]
18:52:30, error: 0x80072733
18:52:32, error: 0x80072733

This bug only impacts Compute Engine VMs with VirtIO NICs. VMs that use gVNIC don't encounter this issue.

To avoid this issue, Google recommends using other NTP drift measuring tools, such as the Meinberg Time Server Monitor.

Inaccessible boot device after updating a VM from Gen 1 or 2 to a Gen 3+ VM

Windows Server binds the boot drive to its initial disk interface type upon first startup. To change an existing VM from an older machine series that uses a SCSI disk interface to a newer machine series that uses an NVMe disk interface, perform a Windows PnP driver sysprep before shutting down the VM. This sysprep only prepares device drivers and verifies that all disk interface types are scanned for the boot drive on the next start.

To update the machine series of a VM, do the following:

From a Powershell prompt as Administrator run:

PS C:\> start rundll32.exe sppnp.dll,Sysprep_Generalize_Pnp -wait
  1. Stop the VM.
  2. Change the VM to the new VM machine type.
  3. Start the VM.

If the new VM doesn't start correctly, change the VM back to the original machine type in order to get your VM running again. It should start successfully. Review the migration requirements to verify that you meet them. Then retry the instructions.

Limited disk count attachment for newer VM machine series

VMs running on Microsoft Windows with the NVMe disk interface, which includes T2A and all third-generation VMs, have a disk attachment limit of 16 disks. This limitation does not apply to fourth-generation VMs (C4, M4). To avoid errors, consolidate your Persistent Disk and Hyperdisk storage to a maximum of 16 disks per VM. Local SSD storage is excluded from this issue.

Replace the NVME driver on VMs created before May 2022

If you want to use NVMe on a VM that uses Microsoft Windows, and the VM was created prior to May 1, 2022, you must update the existing NVMe driver in the Guest OS to use the Microsoft StorNVMe driver.

You must update the NVMe driver on your VM before you change the machine type to a third generation machine series, or before creating a boot disk snapshot that will be used to create new VMs that use a third generation machine series.

Use the following commands to install the StorNVME driver package and remove the community driver, if it's present in the guest OS.

googet update
googet install google-compute-engine-driver-nvme

Lower performance for Local SSD on Microsoft Windows with C3 and C3D VMs

Local SSD performance is limited for C3 and C3D VMs running Microsoft Windows.

Performance improvements are in progress.

Lower performance for Hyperdisk Extreme volumes attached to n2-standard-80 instances running Microsoft Windows

Microsoft Windows instances running on n2-standard-80 machine types can reach at most 80,000 IOPS across all Hyperdisk Extreme volumes that are attached to the instance.

Resolution

To reach up to 160,000 IOPS with N2 instances running Windows, choose one of the following machine types:

  • n2-highmem-80
  • n2-highcpu-80
  • n2-standard-96
  • n2-highmem-96
  • n2-highcpu-96
  • n2-highmem-128
  • n2-standard-128

Poor networking throughput when using gVNIC

Windows Server 2022 and Windows 11 VMs that use gVNIC driver GooGet package version 1.0.0@44 or earlier might experience poor networking throughput when using Google Virtual NIC (gVNIC).

To resolve this issue, update the gVNIC driver GooGet package to version 1.0.0@45 or later by doing the following:

  1. Check which driver version is installed on your VM by running the following command from an administrator Command Prompt or Powershell session:

    googet installed
    

    The output looks similar to the following:

    Installed packages:
      ...
      google-compute-engine-driver-gvnic.x86_64 VERSION_NUMBER
      ...
    
  2. If the google-compute-engine-driver-gvnic.x86_64 driver version is 1.0.0@44 or earlier, update the GooGet package repository by running the following command from an administrator Command Prompt or Powershell session:

    googet update
    

Large C4, C4D, and C3D vCPU machine types don't support Windows OS images

C4 machine types with more than 144 vCPUS and C4D and C3D machine types with more than 180 vCPUs don't support Windows Server 2012 and 2016 OS images. Larger C4, C4D, and C3D machine types that use Windows Server 2012 and 2016 OS images will fail to boot. To work around this issue, select a smaller machine type or use another OS image.

C3D VMs created with 360 vCPUs and Windows OS images will fail to boot. To work around this issue, select a smaller machine type or use another OS image.

C4D VMs created with more than 255 vCPUs and Windows 2025 will fail to boot. To work around this issue, select a smaller machine type or use another OS image.

Generic disk error on Windows Server 2016 and 2012 R2 for M3, C3, C3D, and C4D VMs

The ability to add or resize a Hyperdisk or Persistent Disk for a running M3, C3, C3D, or C4D VM doesn't work as expected on specific Windows guests at this time. Windows Server 2012 R2 and Windows Server 2016, and their corresponding non-server Windows variants, don't respond correctly to the disk attach and disk resize commands.

For example, removing a disk from a running M3 VM disconnects the disk from a Windows Server instance without the Windows operating system recognizing that the disk is gone. Subsequent writes to the disk return a generic error.

Resolution

You must restart the M3, C3, C3D, or C4D VM running on Windows after modifying a Hyperdisk or Persistent Disk for the disk modifications to be recognized by these guests.