Investigate performance issues with AI assistance
This document describes how you can use AI assistance in Bigtable to investigate performance anomalies like hotspots or high cluster latency. You can use the AI-assisted troubleshooting tools of Bigtable and Gemini Cloud Assist to monitor system performance.
Before you begin
Ensure that Gemini Cloud Assist is set up for your Google Cloud user account and project.
After you set up Gemini Cloud Assist, you might need to wait five minutes to let the service propagate before you can enable AI-assisted troubleshooting in Bigtable.
Required roles and permissions
To get the permissions that you need to observe and troubleshoot using Gemini Cloud Assist, ask your administrator to grant you the following IAM roles on your project:
-
View monitoring dashboards and metrics:
Monitoring Viewer (
roles/monitoring.viewer) -
Read Bigtable data:
Bigtable Reader (
roles/bigtable.reader) -
View database performance issues:
Database Insights Viewer (
roles/databaseinsights.viewer) -
Use Gemini Cloud Assist investigations:
Gemini Cloud Assist Investigation Owner (
roles/geminicloudassist.investigationOwner)
For more information about granting roles, see Manage access to projects, folders, and organizations.
You might also be able to get the required permissions through custom roles or other predefined roles.
For more information about required roles and permissions for using Gemini Cloud Assist investigations, see Troubleshoot issues with Gemini Cloud Assist Investigations.
Open Gemini Cloud Assist
To use Gemini Cloud Assist with Bigtable, do the following:
Open the list of Bigtable instances in the Google Cloud console.
The Instances page displays a list of instances.
To open an instance, click the instance name.
To open the Cloud Assist panel, click Open or close Gemini Cloud Assist chat.
In the Cloud Assist panel, enter a prompt that describes the information that you're interested in. For example, "Why is my Bigtable cluster latency so high?".
After you enter the prompt, click Send prompt.
Gemini returns a response to your prompt based on information from the last hour.
Investigate performance issues
When you observe a spike or increase in metrics such as CPU utilization or latency, you can use investigations to identify the root cause. This process uses AI assistance from Gemini Cloud Assist to analyze your cluster's performance data and provide actionable recommendations.
Open the list of Bigtable instances in the Google Cloud console.
The Instances page displays a list of instances.
To open an instance, click the instance name.
In the navigation menu, click System insights.
To troubleshoot your database load, click Explore Investigations.
The Investigation options pane opens.
In the Cluster field, select the cluster for which you want to view the investigation.
From the list of time filters, select the time range for your investigation or provide a custom date and time for your filter.
Click Investigate.
After about two minutes, the Investigation Overview pane opens with the following sections:
- Issue: a description of the issue being investigated, including the investigation start and stop time.
- Findings: a list of observations about the issue. For example, these can include client misconfiguration details, such as an incorrect channel pool size.
- Hypotheses: a list of AI-recommended actions to take to help address the anomalies.
How investigations analyze your database
When you start an investigation, Gemini Cloud Assist analyzes performance data for the time period you selected. It compares your baseline aggregated data against the following metrics:
- Client latencies
- Server latencies
- CPU utilization
- Disk utilization
- Request count
When Gemini Cloud Assist completes the analysis and detects a significant change, the investigation identifies a possible root cause and lists actionable insights in the Hypotheses section of the Investigation details pane to help remediate the issue.
For some situations, depending on the analysis, there might not be a recommendation.
What's next
- If you want to continue troubleshooting or get more assistance with system performance, see Gemini Cloud Assist.
- Troubleshoot latency.