Use Prometheus alerts to monitor Spanner Omni

This document describes the Prometheus alerts available for Spanner Omni. Use these alerts to monitor the status and performance of your Spanner Omni deployment.

TrueTime alerts

Use the following alerts to monitor the status of TrueTime in your deployment:

Alert Severity Duration Description
TrueTimeUnavailable critical 1 minute TrueTime is unavailable for more than 1 minute.
ClockSlaViolation critical 1 minute A server has violated the clock service level agreement (SLA).

CPU alerts

Use the following alerts to monitor the CPU utilization of your deployment:

Alert Severity Duration Description
SpannerHighCPUUtilization warning 5 minutes Overall CPU utilization has been higher than 65% for more than 5 minutes.

Storage alerts

Use the following alerts to monitor the storage utilization of your deployment:

Alert Severity Duration Description
SpannerStorageUtilizationWarning warning 5 minutes Spanner Omni storage on a server is high (80%).
SpannerStorageUtilizationCritical critical 5 minutes Spanner Omni storage on a server is high (90%).
SpannerStoragePerVCPUTooHigh warning 5 minutes The storage per vCPU is exceeding 500 GB.

What's next