Best practices for GKE

This document provides a consolidated overview of best practices for designing, building, and operating applications on Google Kubernetes Engine (GKE). Following these recommendations helps you optimize for cost, performance, security, and reliability. Each entry links to more detailed documentation for specific topics.

Select your GKE category:

Or, search for a best practice:

Category Best practices Summary
AI and ML workloads Optimize AI/ML workload efficiency Maximize resource efficiency for AI/ML tasks on GKE.
AI and ML workloads Inference workloads Run machine learning inference workloads on GKE.
AI and ML workloads Autoscale LLM inference with GPUs Automatically scale large language model (LLM) inference using GPUs on GKE.
AI and ML workloads Autoscale LLM inference with TPUs Automatically scale large language model (LLM) inference using TPUs on GKE.
AI and ML workloads Optimize LLM inference with GPUs Optimize LLM inference with GPUs on GKE.
AI and ML workloads Batch processing platform Build and operate a batch processing platform on GKE.
Cost optimization Run cost-effective Kubernetes applications Reduce the operational costs of Kubernetes applications on GKE.
Databases Database options Choose and manage database solutions for GKE applications.
Networking Networking Configure and manage network connectivity for GKE.
Operations Upgrading clusters Learn how to perform smooth and reliable GKE cluster upgrades.
Operations CI/CD for GKE Implement continuous integration and delivery pipelines for GKE applications.
Reliability and scalability Scalability Learn principles and techniques for scaling applications on GKE.
Reliability and scalability Plan for scalability Learn strategies for designing scalable GKE environments.
Reliability and scalability Plan large GKE clusters Learn how to architect and manage large-scale GKE clusters.
Reliability and scalability Plan large workloads Deploy and manage resource-intensive applications.
Security Harden your GKE cluster Enhance the GKE security posture of your GKE clusters.
Security Plan RBAC policies Define role-based access control to manage permissions.
Security Enterprise multi-tenancy Run multiple tenants securely on a single GKE cluster.