This overview page explains how you can configure internal and external load balancers in Google Distributed Cloud (GDC) air-gapped for both zonal and global network configurations.
Load balancing for GDC ensures efficient traffic distribution across backend workloads, enhancing application availability and performance.
This page is for network administrators within the platform administrator group, or developers within the application operator group, who are responsible for managing network traffic for their organization. For more information, see Audiences for GDC air-gapped documentation.
Load balancing architecture
GDC provides load balancers that enable applications to expose services to one another. Load balancers allocate a stable virtual IP (VIP) address that balances traffic over a set of backend workloads. Load balancers in GDC perform layer four (L4) load balancing, which means they map a set of configured frontend TCP or UDP ports to corresponding backend ports. Load balancers are configured at the project level.
Load balancers are configured for the following workload types:
- Workloads running on VMs.
- Containerized workloads inside the Kubernetes cluster.
There are three ways to configure load balancers in GDC:
- Use the Networking Kubernetes Resource Model (KRM) API. You can use this API to create global or zonal load balancers.
- Use the gdcloud CLI. You can use this API to create global or zonal load balancers.
- Use the Kubernetes Service directly from the Kubernetes cluster. This method only creates zonal load balancers.
Load balancer components
When you use the KRM API or gdcloud CLI to configure the load balancer, you use an L4 passthrough load balancer:
- L4 means the protocol is either TCP or UDP.
- Passthrough means there is no proxy between workload and client.
The load balancer consists of the following configurable components:
- Forwarding rules: specify what traffic is forwarded, and to which backend service. Forwarding rules have the following specifications: - Consists of three tuples, CIDR, port and protocol, for the client to access.
- Supports TCP and UDP protocols.
- Offers internal and external forwarding rules. Clients can access internal forwarding rules from within the Virtual Private Cloud (VPC). Clients can access external forwarding rules from outside the GDC platform or from within by workloads that have EgressNATvalue defined.
- Forwarding rules connect to a backend service. You can point multiple forwarding rules to point to the same backend service.
 
- Backend services: are the load balancing hub that link forwarding rules, health checks and backends together. A backend service references a backend object, that identifies the workloads the load balancer forwards the traffic to. There are limitations on what backends a single backend service can reference: - One zonal backend resource per zone.
- One cluster backend resource per cluster. This can't be mixed with the project backends.
 
- Backends: a zonal object that specifies the endpoints that serve as the backends for the created backend services. Backend resources must be scoped to a zone. Select endpoints using labels. Scope the selector to a project or cluster: - A project backend is a backend that doesn't have the - ClusterNamefield specified. In this case the specified labels apply to all of the workloads in the specific project, in the specific VPC of a zone. The labels are applied to VM and pod workloads across multiple clusters. When a backend service uses a project backend, you can't reference another backend for that zone in that backend service.
- A cluster backend is a backend that has the - ClusterNamefield specified. In this case, the specified labels apply to all the workloads in the named cluster in the specified project. You can specify, at most, one backend per zone per cluster in a single backend service.
 
- Health checks: specify the probes to determine whether a given workload endpoint in the backend is healthy. The unhealthy endpoint is taken out from the load balancer, until it becomes healthy again. Health checks are only applicable to VM workloads. Pod workloads can use the built-in Kubernetes probe mechanism to determine if a specific endpoint is healthy. 
When using the Kubernetes Service directly from the Kubernetes user cluster,
you use the Service object instead of the previously listed components. You can only target workloads in the cluster where the Service object is created.
External and internal load balancing
GDC applications have access to the following networking service types:
- Internal Load Balancer (ILB): lets you expose a service to other clusters within the organization.
- External Load Balancer (ELB): allocates a VIP address from a range that is routable from external workloads and exposes services outside of the GDC organization, such as other organizations inside or outside of the GDC instance. Use session affinity for ELBs to ensure that requests from a client are consistently routed to the same backend.
Global and zonal load balancers
You can create global or zonal load balancers. The scope of global load
balancers span across a GDC universe. Each
GDC universe can consist of multiple
GDC zones organized into regions that are interconnected
and share a control plane. For example, a universe consisting two regions with
three zones each might look like: us-virginia1-a, us-virginia1-b,
us-virginia1-c and eu-ams1-a, eu-ams1-b, eu-ams1-c.
The scope of zonal load balancers is limited to the zones specified at the time of creation. Each zone is an independent disaster domain. A zone manages infrastructure, services, APIs, and tooling that use a local control plane.
For more information on global and zonal resources in a GDC universe, see Multi-zone overview.
You can create global load balancers using the following methods:
- Use the Networking Kubernetes Resource Model (KRM)
API. Use the API
version networking.global.gdc.googto create global resources.
- Use the gdcloud CLI.
Use the --globalflag when using the gdcloud CLI commands to specify a global scope.
You can create zonal load balancers using the following methods:
- Use the Networking Kubernetes Resource Model (KRM)
API. Use the API
version networking.gdc.googto create zonal resources.
- Use the gdcloud CLI.
Use the --zoneflag when using the gdcloud CLI commands to specify which zones to create load balancers for.
- Use the Kubernetes Service directly from the Kubernetes cluster.
Service virtual IP addresses
ILBs allocate VIP addresses that are internal only to the organization. These VIP addresses are not reachable from outside the organization; therefore, you can only use them to expose services to other applications within an organization. These IP addresses may overlap between organizations in the same instance.
On the other hand, ELBs allocate VIP addresses that are externally reachable from outside the organization. For this reason, ELB VIP addresses must be unique among all organizations. Typically, fewer ELB VIP addresses are available for use by the organization.
Limitations
- The - BackendServiceresource must not be configured with a- HealthCheckresource for pod workloads. Note that the- HealthCheckNamein the- BackendServicespecification is optional and must be omitted when configuring a load balancer with pods.
- A load balancer configuration can't target mixed workloads involving pods and VMs. Therefore, mixed backends involving pods and VMs in one - BackendServiceresource is not allowed.
- A global load balancer custom resource ( - ForwardingRuleExternal,- ForwardingRuleInternal,- BackendService, or- HealthCheck) must not have the same name as these zonal load balancer custom resources.- What's next