Fixing Kubelet Socket Exhaustion and TIME-WAIT Issues in Kubernetes

#Kubernetes #Kubelet #Networking #Performance Tuning #Socket Exhaustion #Linux Kernel

Solution Summary

High-frequency liveness and readiness probes in Kubernetes can cause Kubelet socket exhaustion as TCP connections linger in the TIME-WAIT state. To resolve this, a custom ProbeDialer utilizes the SO_LINGER socket option with a 1-second timeout. This forces immediate kernel reclamation of socket metadata, preventing ephemeral port depletion on high-density nodes.

The Problem

Resolve Kubelet socket exhaustion caused by high-frequency liveness and readiness probes. Learn how to optimize TCP connection handling for large-scale clusters.

Why does this happen?

Kubernetes probes create short-lived TCP connections that leave sockets in a 60-second TIME-WAIT state, eventually depleting ephemeral ports and conntrack entries on high-density nodes, leading to cascading network instability.

Code Example

/* Configure SO_LINGER to 1s in your custom ProbeDialer */
func (d *ProbeDialer) Dial(network, address string) (net.Conn, error) {
    dialer := &net.Dialer{
        Control: func(network, address string, c syscall.RawConn) error {
            return c.Control(func(fd uintptr) {
                // Set SO_LINGER to 1 second to force socket cleanup
                syscall.SetsockoptLinger(int(fd), syscall.SOL_SOCKET, syscall.SO_LINGER, &syscall.Linger{Onoff: 1, Linger: 1})
            })
        },
    }
    return dialer.Dial(network, address)
}

Step-by-Step Fix

To resolve this, implement a custom ProbeDialer using the SO_LINGER socket option to override the default kernel teardown behavior. By configuring a linger timeout of 1 second rather than the 60-second default, the kernel reclaims socket metadata immediately after the probe handshake, preventing port exhaustion without triggering abortive RST packets.

Fixing Kubelet Socket Exhaustion and TIME-WAIT Issues in Kubernetes

Solution Summary

The Problem

Why does this happen?

Code Example

Step-by-Step Fix

Related Solutions

Fixing Intermittent Connection Delays and SNAT Conflicts in Kubernetes VXLAN Clusters

Fixing Premature Traffic Drops During Kubernetes Node Draining

Optimizing Kube-Proxy Performance in Large-Scale Kubernetes Clusters

Solution Summary

The Problem

Why does this happen?

Code Example

Step-by-Step Fix

Related Solutions

Fixing Intermittent Connection Delays and SNAT Conflicts in Kubernetes VXLAN Clusters

Fixing Premature Traffic Drops During Kubernetes Node Draining

Optimizing Kube-Proxy Performance in Large-Scale Kubernetes Clusters

We value your privacy