Traffic distribution for regional external passthrough Network Load Balancers

This page explains concepts about how a regional external passthrough Network Load Balancer distributes traffic.

Backend selection and connection tracking

Backend selection and connection tracking work together to balance multiple connections across different backends and to route all packets for each connection to the same backend. This is accomplished with a two-part strategy. First, a backend is selected using consistent hashing. Then, this selection is recorded in a connection tracking table.

The following steps describe backend selection and connection tracking.

1. Check for a connection tracking table entry

The load balancer determines whether a load-balanced packet belongs to a new connection or to an existing connection by using the following process:

TCP packet with the SYN flag:
- If the load balancer's connection tracking mode is PER_CONNECTION, continue to the Identify eligible backends step. In PER_CONNECTION tracking mode, a TCP packet with the SYN flag always represents a new connection, regardless of the configured session affinity.
- If the load balancer's connection tracking mode is PER_SESSION and the session affinity is either NONE or CLIENT_IP_PORT_PROTO, continue to the Identify eligible backends step. In PER_SESSION tracking mode, a TCP packet with the SYN flag represents a new connection only when using one of the 5-tuple session affinity options (NONE or CLIENT_IP_PORT_PROTO).

Connection tracking not supported: if the configured session affinity doesn't support connection tracking for the packet's protocol, continue to the Identify eligible backends step. For information about which protocols are connection trackable, see the table in the Connection tracking mode section.

All other packets: the load balancer checks if the packet matches an existing connection tracking table entry. The granularity of the packet hash used to check for an existing connection tracking table entry depends on the connection tracking mode and session affinity you configured. For more information, see the table in the Connection tracking mode section.
- If the packet matches a connection tracking table entry, the load balancer sends the packet to the previously selected backend.
- If the packet doesn't match a connection tracking table entry, continue to the Identify eligible backends step.
For information about how long a connection tracking table entry persists and under what conditions it persists, see the Manage connection tracking table entries step.

2. Backend selection steps

For a new connection, the load balancer uses consistent hashing to select a backend from among the eligible backends.

The following steps outline the process to select an eligible backend for a new connection and then record that connection in a connection tracking table.

2.1 Identify eligible backends

Eligible backends are the backends that are candidates to receive new connections. The following table defines the set of eligible backends depending on whether the load balancer uses failover and whether weighted load balancing is enabled.

Failover	Weighted load balancing	Eligible backends
		When both the failover and weighted load balancing features are disabled, all configured backends are primary backends. The set of eligible backends is defined as follows: When at least one backend is healthy, the set of eligible backends consists of all healthy backends. When all backends are unhealthy, the set of eligible backends consists of all backends.
		When the failover load balancing feature is disabled, and the weighted load balancing feature is enabled, the eligible backends are those that come from the first of the following sets that isn't empty: All nonzero weight healthy backends All nonzero weight unhealthy backends All zero weight healthy backends All zero weight unhealthy backends
		When the failover load balancing feature is enabled, and the weighted load balancing feature is disabled, the load balancer uses health check information and failover policy configuration to define the set of eligible backends: When at least one (primary or failover) backend is healthy, the eligible backends are those that come from the first of the following sets that isn't empty: If there are no healthy primary backends, the eligible backends are all healthy failover backends. If there are no healthy failover backends, the eligible backends are all healthy primary backends. If the failover ratio is set to `0.0` (the default value), the eligible backends are all healthy primary backends. If the ratio of the number of healthy primary backends compared to the total number of primary backends is greater than or equal to the configured failover ratio, the eligible backends consists of all healthy primary backends. The eligible backends consist of all healthy failover backends. When there are no healthy (primary and failover) backends, the set of eligible backends depends exclusively on the failover policy configuration: If the failover policy is configured to drop new connections when all primary and failover backends are unhealthy, the set of eligible backends is empty. Consequently, the load balancer drops packets for new connections. If the failover policy is not configured to drop new connections when all primary and failover backends are unhealthy, the eligible backends are all unhealthy primary backends.
		When both the failover and weighted load balancing features are enabled, the load balancer uses weight information, health check information, and failover policy configuration to define the set of eligible backends: When at least one nonzero weight (primary or failover) backend is healthy, the eligible backends are those that come from the first of the following sets that isn't empty: If the set of nonzero weight healthy primary backends is empty, the eligible backends are all nonzero weight healthy failover backends. If the set of nonzero weight healthy failover backends is empty, the eligible backends are all nonzero weight healthy primary backends. If the failover ratio is set to `0.0` (the default value), the eligible backends are all nonzero weight healthy primary backends. If the ratio of the number of nonzero weight healthy primary backends compared to the total number of primary backends is greater than or equal to the configured failover ratio, the eligible backends consists of all nonzero weight healthy primary backends. The eligible backends consist of all nonzero weight healthy failover backends. When there are no healthy, nonzero weight (primary and failover) backends, the set of eligible backends depends on the failover policy configuration: If the failover policy is configured to drop new connections when there are no healthy, nonzero weight primary and failover backends, the set of eligible backends is empty. Consequently, the load balancer drops packets for new connections. If the failover policy is not configured to drop new connections when there are no healthy, nonzero weight primary and failover backends, the eligible backends are those that come from the first of the following sets that isn't empty: All nonzero weight unhealthy primary backends All nonzero weight unhealthy failover backends All zero weight healthy primary backends All zero weight healthy failover backends All zero weight unhealthy primary backends All zero weight unhealthy failover backends

2.2 Select an eligible backend

The load balancer maintains hashes of eligible backends, with each backend hash mapped to a unit circle. Weighted load balancing alters how eligible backends are mapped to the circle such that backends with higher weights are more likely to be selected, proportional to their weights.

When processing a packet for a connection that's not in the connection tracking table, the load balancer computes a hash of the packet characteristics and maps that hash to the same unit circle, selecting an eligible backend on the circle's circumference. The set of packet characteristics used to calculate the packet hash is defined by the session affinity setting. For example, when the selected session affinity results in a 2-tuple or 3-tuple backend selection hash, all TCP connections from a source IP address are mapped to the same eligible backend.

If a session affinity isn't explicitly configured, the NONE session affinity is the default.

Consistent hashing ensures the load balancer assigns new connections to eligible backends in a way that minimizes mapping disruptions even if the number of eligible backends or their weights change.
- The load balancer always selects the same eligible backend for a connection, and more generally, always selects the same eligible backend for all packets with identical packet characteristics as defined by the session affinity setting, in the following situations:
  - If weighted load balancing isn't configured, when the set of eligible backends does not change.
  - If weighted load balancing is configured, when the set of eligible backends does not change, and the weight of each eligible backend remains constant.
- If an eligible backend is added, removed, or has its weight changed, consistent hashing aims to minimize the disruption of mappings to the other eligible backends—that is, most connections that map to other eligible backends continue to map to the same eligible backend.
Furthermore, consistent hashing ensures the load balancer distributes new connections among eligible backends as fairly as possible. For all possible packet hashes as defined by the configured session affinity setting (and more specifically, for all possible connections when the session affinity results in a 5-tuple hash for backend selection):
- If weighted load balancing isn't configured, approximately 1/N possible packet hashes map to each eligible backend, where N is the count of eligible backends.
- If weighted load balancing is configured, the ratio of possible packet hashes that map to each eligible backend is approximately: the weight of an eligible backend divided by the sum of all eligible backend weights.
The following two examples show how weighted load balancing affects the selection probabilities of each eligible backend:
- If the backend service has two eligible backends—the first having weight 1 and the second having weight 4—the first eligible backend has a 20% (1÷(1+4)) selection probability, and the second eligible backend has an 80% (4÷(1+4)) selection probability.
- If the backend service has three eligible backends—eligible backend a having weight 0, eligible backend b having weight 2, and eligible backend c having weight 6—backend a has a 0% (0÷(0+2+6)) selection probability, backend b has a 25% (2÷(0+2+6)) selection probability, and backend c has a 75% (6÷(0+2+6)) selection probability.

2.3 Create a connection tracking table entry

After selecting a backend, the load balancer creates a connection tracking table entry if the configured session affinity supports connection tracking for the packet's protocol.

If the configured session affinity doesn't support connection tracking for the packet's protocol, skip this step.
If the configured session affinity supports connection tracking for the packet's protocol, the connection tracking table entry that's created maps packet characteristics to the selected backend. The packet header fields used for this mapping depend on the connection tracking mode and session affinity you configured.

For information about which protocols are connection trackable based on your configuration choices, and what packet characteristics are used for the hash in the connection tracking table, see the table in the Connection tracking mode section.

3. Manage connection tracking table entries

The load balancer manages connection tracking table entries according to the following events and rules:

Idle entries are removed: a connection tracking table entry is removed after the connection has been idle for 60 seconds. For more information, see Idle timeout.

Closed TCP connections: connection tracking table entries for TCP connections are not removed when a TCP connection is closed with a FIN or RST packet. They might be removed later as an idle entry. Each new TCP connection always carries the SYN flag, and is subject to the processing described in the Check for a connection tracking table entry step.
Connection draining on failover: when at least one failover backend is configured and the connection draining on failover setting is disabled, the load balancer removes all entries in the connection tracking table when the set of eligible backends switches between primary and failover backends. For more information, see Connection draining on failover.
Connection persistence on unhealthy backends: entries in the connection tracking table can be removed if a backend becomes unhealthy. This behavior depends on factors described in Connection persistence on unhealthy backends.
- When a connection tracking table entry is removed because a previously selected backend changes from healthy to unhealthy, subsequent packets for the connection are treated as if they belong to a new connection. After selecting a new eligible backend for the subsequent packets, the load balancer creates a replacement connection tracking table entry.
- A replacement connection tracking table entry behaves exactly like any other connection tracking table entry, and is subject to the events and rules of this step.
- If the previously selected backend returns to healthy from unhealthy, the health check change alone doesn't cause the replacement connection tracking table entry to be removed. An exception happens when at least one failover backend is configured and the connection draining on failover setting is disabled; if the change in health check state of a previously selected backend coincides with the set of eligible backends switching between primary and failover backends, connection tracking table entries are removed.
Connection draining for removed, stopped, or deleted backends: if connection draining for removed, stopped, or deleted backends is enabled, connection tracking table entries are removed after a configurable connection draining timeout. Counting to the timeout begins when the command to remove, stop, or delete a backend is received. If connection draining for removed, stopped, or deleted backends is disabled, connection tracking table entries are removed when the command to remove, stop, or delete a backend is received. For more information, see Enable connection draining.

Session affinity

The session affinity setting of a regional external passthrough Network Load Balancer defines the packet hash for backend selection, and, based on the connection tracking mode, the packet hash for connection tracking.

You configure session affinity on the backend service, not on each backend instance group or NEG. The session affinity determines which IP and Layer 4 headers are used to calculate a hash of packet characteristics. The hash of packet characteristics is used in the Backend selection steps.

Regional external passthrough Network Load Balancers support the following session affinity settings.

Hash method for backend selection Session affinity setting

Hash method for backend selection	Session affinity setting
5-tuple hash (consists of source IP address, source port, protocol, destination IP address, and destination port) for non-fragmented packets that include port information (TCP packets and non-fragmented UDP packets) OR 3-tuple hash (consists of source IP address, destination IP address, and protocol) for fragmented UDP packets and packets of all other protocols	`NONE`¹ OR `CLIENT_IP_PORT_PROTO`
3-tuple hash (consists of source IP address, destination IP address, and protocol)	`CLIENT_IP_PROTO`
2-tuple hash (consists of source IP address and destination IP address)	`CLIENT_IP`

5-tuple hash (consists of source IP address, source port, protocol, destination IP address, and destination port) for non-fragmented packets that include port information (TCP packets and non-fragmented UDP packets)

3-tuple hash (consists of source IP address, destination IP address, and protocol) for fragmented UDP packets and packets of all other protocols

NONE¹
OR
CLIENT_IP_PORT_PROTO

3-tuple hash
(consists of source IP address, destination IP address, and protocol) CLIENT_IP_PROTO

2-tuple hash
(consists of source IP address and destination IP address) CLIENT_IP

¹ NONE session affinity doesn't indicate that there is no session affinity. Instead, it means that session affinity is done with a 5-tuple hash or a 3-tuple hash of packet characteristics—functionally the same behavior as when CLIENT_IP_PORT_PROTO is set.

Connection tracking policy

This section describes the settings in the load balancer's connection tracking policy:

Connection tracking mode
Connection persistence on unhealthy backends
Idle timeout

Connection tracking mode

This section describes when and how the load balancer creates entries in its connection tracking table. Regional external passthrough Network Load Balancers support connection tracking based on protocol and session affinity:

TCP connections are always connection trackable, for all session affinity options.
UDP, ESP, and GRE connections are connection trackable for all session affinity options except for NONE.
All other protocols, such as ICMP and ICMPv6, aren't connection trackable.

When connection tracking is possible, the connection tracking mode, protocol, and session affinity determine the set of packet characteristics that are used to make the packet hash in each connection tracking table entry.

The connection tracking mode can be one of the following:

PER_CONNECTION. This is the default and most granular connection tracking mode. Each connection is defined as either a 5-tuple hash or a 3-tuple hash of packet characteristics, depending on whether port information is present in the packet. Non-fragmented packets that include port information (such as TCP packets and non-fragmented UDP packets) are tracked with 5-tuple hashes. All other packets are tracked with 3-tuple hashes.
PER_SESSION. This less granular connection tracking mode uses a hash that matches the session affinity hash. Depending on the chosen session affinity, the PER_SESSION tracking mode can treat multiple distinct connections as a single connection for connection tracking purposes. This reduces the frequency that a connection is considered new and subject to the Backend selection steps.

The following table summarizes:

The packet hashes used for backend selection; and
The packet hashes used for connection tracking, based on the connection tracking mode, protocol, and session affinity.

Session affinity	Packet hash for backend selection	Packet hash for connection tracking
Session affinity	Packet hash for backend selection	When using `PER_CONNECTION` tracking mode (default)	When using `PER_SESSION` tracking mode
`NONE` (Default)	TCP and unfragmented UDP: 5-tuple hash Fragmented UDP and all other protocols: 3-tuple hash	TCP: connection tracking on, 5-tuple hash All other protocols: connection tracking off	TCP: connection tracking on, 5-tuple hash All other protocols: connection tracking off
`CLIENT_IP_PORT_PROTO`	TCP and unfragmented UDP: 5-tuple hash Fragmented UDP and all other protocols: 3-tuple hash	TCP and unfragmented UDP: connection tracking on, 5-tuple hash Fragmented UDP, ESP, and GRE: connection tracking on, 3-tuple hash All other protocols: connection tracking off	TCP and unfragmented UDP: connection tracking on, 5-tuple hash Fragmented UDP, ESP, and GRE: connection tracking on, 3-tuple hash All other protocols: connection tracking off
`CLIENT_IP_PROTO`	All protocols: 3-tuple hash	TCP and unfragmented UDP: connection tracking on, 5-tuple hash Fragmented UDP, ESP, and GRE: connection tracking on, 3-tuple hash All other protocols: connection tracking off	TCP, UDP, ESP, GRE: connection tracking on, 3-tuple hash All other protocols: connection tracking off
`CLIENT_IP`	All protocols: 2-tuple hash	TCP and unfragmented UDP: connection tracking on, 5-tuple hash Fragmented UDP, ESP, and GRE: connection tracking on, 3-tuple hash All other protocols: connection tracking off	TCP, UDP, ESP, GRE: connection tracking on, 2-tuple hash All other protocols: connection tracking off

To learn how to change the connection tracking mode, see Configure a connection tracking policy.

Connection persistence on unhealthy backends

Connection persistence on unhealthy backends controls whether existing connections persist on a previously-selected backend VM or endpoint after the backend becomes unhealthy, provided that the backend stays in a load-balanced instance group or NEG.

The following connection persistence options are available:

DEFAULT_FOR_PROTOCOL (default)
NEVER_PERSIST
ALWAYS_PERSIST

The following table summarizes whether connections persist based on unhealthy backends, depending on the connection persistence option, session affinity, connection tracking mode, and protocol.

Connection persistence on unhealthy backends option	Connection persistence on unhealthy backends behavior
Connection persistence on unhealthy backends option	When using `PER_CONNECTION` tracking mode (default)	When using `PER_SESSION` tracking mode
`DEFAULT_FOR_PROTOCOL`	TCP: connections persist on unhealthy backends (all session affinities) All other protocols: connections never persist on unhealthy backends	TCP: connections persist on unhealthy backends if session affinity is `NONE` or `CLIENT_IP_PORT_PROTO` All other protocols: connections never persist on unhealthy backends
`NEVER_PERSIST`	All protocols: connections never persist on unhealthy backends
`ALWAYS_PERSIST` This option should only be used for advanced use cases.	TCP: connections persist on unhealthy backends (all session affinities) ESP, GRE, UDP: connections persist on unhealthy backends if session affinity is not `NONE` All other protocols: not applicable because they are not connection-trackable	Configuration not possible

When connection persistence on unhealthy backends applies to traffic, each connection persists as long as a corresponding connection tracking table entry exists. For more information, see the Manage connection tracking table entries step.

To learn how to change the connection persistence behavior, see Configure a connection tracking policy.

TCP connection persistence behavior on unhealthy backends

The load balancer uses 5-tuple hash connection tracking for TCP connections in these situations:

When using the PER_CONNECTION tracking mode (all session affinities), or
When using the PER_SESSION tracking mode, and the session affinity is either NONE or CLIENT_IP_PORT_PROTO.

When the load balancer uses a 5-tuple hash connection tracking for TCP connections, keep the following behaviors in mind:

If the unhealthy backend continues to respond to packets, the connection continues until it is reset or closed (by either the unhealthy backend or the client).
If the unhealthy backend sends a TCP reset (RST) packet or does not respond to packets, then the client might retry with a new connection, letting the load balancer select a different eligible backend. (TCP SYN packets are treated as new connections in the Identify eligible backends step.)

Idle timeout

Entries in connection tracking tables expire 60 seconds after the load balancer processes the last packet that matched the entry. This idle timeout value can't be modified.

Connection draining for removed, stopped, or deleted backends

Connection draining provides a configurable minimum amount of time for existing connections to persist in the load balancer's connection tracking table when one of the following happens:

A virtual machine (VM) instance is removed from a backend instance group (this includes abandoning an instance in a backend managed instance group)
A VM is stopped or deleted (this includes automatic actions like rolling updates or scaling down a backend managed instance group)
An endpoint is removed from a backend network endpoint group (NEG)

By default, connection draining when backends are removed, stopped, or deleted is disabled. For more information, see Enabling connection draining.

Weighted load balancing

Weighted load balancing influences which backends are eligible backends in the Backend selection steps. Each backend VM or endpoint reports its weight to the load balancer by using an HTTP health check and a custom response header. To use weighted load balancing, you must configure the following on the load balancer's backend service:

The locality policy (localityLbPolicy) must be set to WEIGHTED_MAGLEV.
The health check must be an HTTP health check that sends a special response header:
- The response header's field name must be X-Load-Balancing-Endpoint-Weight.
- The response header's field values can range from 0 to 1000, inclusive.

For more information, see Configure weighted load balancing.

Considerations for weighted load balancing

Weighted load balancing is useful in the following scenarios, which allow a backend to continue processing its existing connections:

Weighted load balancing allows a backend that's processing long-running connections or connections that involve large amounts of data to tell the load balancer to reduce the number of new connections it receives.
Weighted load balancing allows a backend that's overloaded or entering maintenance to remove itself from the eligible backends for new connections. To do this, the overloaded backend sends the X-Load-Balancing-Endpoint-Weight: 0 response header (and can either continue to pass or fail the load balancer health check). This works because all nonzero weight backends (regardless of health check state) are more preferable eligible backends in the Identify eligible backends step.

Keep the following in mind when using load balancing:

If eligible backends change their weights frequently, weighted load balancing can be detrimental to session affinity. For more information, see the Select an eligible backend step.
If you use the same instance group or NEG as a backend of two or more load balancer backend services, you can report a unique weight for each backend service by using the following strategy:
- Use a unique HTTP health check for each backend service. Each health check can use a unique destination port or request-path parameter.
- Configure backend instances or endpoints to respond with appropriate weight information for each health check.

Failover

The failover feature lets you influence the set of eligible backends by classifying backend instance groups or NEGs and their endpoints like this:

Primary backends are VMs or endpoints in an instance group or NEG that's configured as a primary backend group.
Failover backends are VMs or endpoints in an instance group or NEG that's configured as a failover backend group.

By default, when you add an instance group or NEG to a backend service, the instance group or NEG is a primary backend group. You can add up to 50 primary backend groups and 50 failover backend groups to a backend service.

To enable the failover feature, you must configure the load balancer's backend service with both of the following:

At least one non-empty primary backend group
At least one non-empty failover backend group

When failover is enabled, the following factors determine the set of eligible backends:

The health state of each backend
The failover ratio in the failover policy
The drop traffic if backends are unhealthy setting in the failover policy
Whether you are using failover by itself or in conjunction with weighted load balancing

Failover policy

You can adjust the following settings in the failover policy of a backend service. The failover policy is only relevant if failover is enabled.

Failover ratio: a number between 0.0 and 1.0, inclusive.
Drop traffic if backends are unhealthy: a boolean that determines the load balancer's last resort behavior. The failover ratio and the drop traffic if backends are unhealthy setting work together with other factors to control the set of eligible backends.
Connection draining on failover: a boolean that controls whether connections persist on previously-selected backends when the set of eligible backends switches between primary and failover backends.

Failover ratio

The failover ratio determines when the set of eligible backends switches between primary and failover backends. The ratio can be a number between 0.0 to 1.0, inclusive. If you don't specify a failover ratio, Google Cloud uses a default value of 0.0. It's a best practice to set your failover ratio to a number that works for your use case rather than relying on this default.

Connection draining on failover

Connection draining on failover controls whether an existing connection persists on a previously-selected backend VM or endpoint when the set of eligible backends switches between primary and failover backends.

Connection draining on failover is enabled by default. The following table summarizes whether connections persist, depending on the connection draining on failover option and protocol:

Connection draining on failover option	Behavior when the set of eligible backends switches between primary and failover backends
Enabled (default)	Protocols that are connection trackable: connections persist, as long as a corresponding connection tracking table entry exists, when the set of eligible backends switches between primary and failover backends. For more information, see the Manage connection tracking table entries step. Protocols that aren't connection trackable: connections don't persist when the set of eligible backends switches between primary and failover backends. For information about which protocols are connection trackable, see the table in the Connection tracking mode section.
Disabled	All protocols: connections don't persist when the set of eligible backends switches between primary and failover backends

Connection draining on failover option

Behavior when the set of eligible backends switches between primary and failover backends

Enabled (default)

Protocols that are connection trackable: connections persist, as long as a corresponding connection tracking table entry exists, when the set of eligible backends switches between primary and failover backends. For more information, see the Manage connection tracking table entries step.
Protocols that aren't connection trackable: connections don't persist when the set of eligible backends switches between primary and failover backends.

For information about which protocols are connection trackable, see the table in the Connection tracking mode section.

Disabled

All protocols: connections don't persist when the set of eligible backends switches between primary and failover backends

Disabling connection draining on failover is useful for the following scenarios:

Patching backend VMs. Prior to patching, you can configure nonzero weight healthy primary backends to fail health checks, or you can set their weights to zero. This way, eligible backends can be nonzero weight healthy failover backends. By disabling connection draining on failover, the load balancer removes connection tracking table entries, applies the Backend selection steps to subsequent packets and delivers them to a different eligible backend. The different backend then closes the connection with a TCP reset, so that client VMs can quickly establish a new connection to the load balancer.
Single backend VM for data consistency. If you need to ensure that the set of eligible backends has no more than one member VM or endpoint, disabling connection draining on failover reduces the possibility of data inconsistencies.

To learn how to disable connection draining on failover, see Defining a failover policy.

Best practices and guidance

You can optimize the a regional external passthrough Network Load Balancer by following these operational guidelines. The following sections provide technical requirements for managing fragmented UDP packets and best practices for testing load distribution from a single client.

Handling UDP fragmentation

Backend service-based regional external passthrough Network Load Balancers can process both fragmented and unfragmented UDP packets. If your application uses fragmented UDP packets, keep the following in mind:

UDP packets might become fragmented before reaching a Google Cloud VPC network.
Google Cloud VPC networks forward UDP fragments as they arrive (without waiting for all fragments to arrive).
Non-Google Cloud networks and on-premises network equipment might forward UDP fragments as they arrive, delay fragmented UDP packets until all fragments have arrived, or discard fragmented UDP packets. For details, see the documentation for the network provider or network equipment.

If you expect fragmented UDP packets and need to route them to the same backends, use the following forwarding rule and backend service configuration parameters:

Forwarding rule configuration: Use only one UDP or L3_DEFAULT forwarding rule per load-balanced IP address, and configure the forwarding rule to accept traffic on all ports. This ensures that all fragments arrive at the same forwarding rule. Even though the fragmented packets (other than the first fragment) lack a destination port, configuring the forwarding rule to process traffic for all ports also configures it to receive UDP fragments that have no port information. To configure all ports, either use the Google Cloud CLI to set --ports=ALL or use the API to set allPorts to True.
Backend service configuration: Set the backend service's session affinity to CLIENT_IP (2-tuple hash) or CLIENT_IP_PROTO (3-tuple hash) so that the same backend is selected for UDP packets that include port information and UDP fragments (other than the first fragment) that lack port information. Set the backend service's connection tracking mode to PER_SESSION so that the connection tracking table entries are built by using the same 2-tuple or 3-tuple hashes.

Testing from a single client

When testing a regional external passthrough Network Load Balancer from a single client, keep the following in mind:

If the client VM is not a backend of the load balancer: new connections are processed as described in the Backend selection and connection tracking steps. In the Select an eligible backend step, the load balancer creates a hash of packet characteristics according to the session affinity.

Remember that all session affinity options rely on at least the IP address of the client, connections from the same client might be distributed to the same eligible backend more frequently than you might expect. Consequently, you can't accurately model the overall distribution of new connections by connecting to a regional external passthrough Network Load Balancer from a single client.
If the client VM is also a backend VM of the load balancer: new connections aren't actually processed by the load balancer at all. Outbound packets with the destination IP address of the load balancer's forwarding rule are routed locally within the guest OS of the client due to the presence of a local route for the forwarding rule.

What's next

To configure an external passthrough Network Load Balancer with a backend service for TCP or UDP traffic only (supporting IPv4 and IPv6 traffic), see Set up an external passthrough Network Load Balancer with a backend service.
To configure an external passthrough Network Load Balancer for multiple IP protocols (supporting IPv4 and IPv6 traffic), see Set up an external passthrough Network Load Balancer for multiple IP protocols.
To configure an external passthrough Network Load Balancer with a zonal NEG backend, see Set up an external passthrough Network Load Balancer with zonal NEGs.
To learn how to transition an external passthrough Network Load Balancer from a target pool backend to a regional backend service, see Transitioning an external passthrough Network Load Balancer from a target pool to a backend service.