aloof-london-38236
02/20/2025, 5:21 PM--debug
turned on:
Tunnel server egress proxy dial error: failed to find Session for client <node>
During the startup of the agent service on both nodes I can see logs like so:
time="2025-02-20T16:44:33Z" level=info msg="Starting k3s agent v1.31.5+k3s1 (56ec5dd4)"
time="2025-02-20T16:44:33Z" level=info msg="Updated load balancer k3s-agent-load-balancer default server: 34.x.x.x:6443"
time="2025-02-20T16:44:33Z" level=info msg="Adding server to load balancer k3s-agent-load-balancer: 10.16.43.69:6443"
time="2025-02-20T16:44:33Z" level=info msg="Adding server to load balancer k3s-agent-load-balancer: 10.16.20.47:6443"
time="2025-02-20T16:44:33Z" level=info msg="Adding server to load balancer k3s-agent-load-balancer: 10.16.32.61:6443"
time="2025-02-20T16:44:33Z" level=info msg="Updated load balancer k3s-agent-load-balancer server addresses -> [10.16.43.69:6443 10.16.20.47:6443 10.16.32.61:6443] [default: 34.x.x.x:6443]"
time="2025-02-20T16:44:33Z" level=info msg="Running load balancer k3s-agent-load-balancer 127.0.0.1:6444 -> [10.16.43.69:6443 10.16.20.47:6443 10.16.32.61:6443] [default: 34.x.x.x:6443]"
time="2025-02-20T16:44:43Z" level=info msg="Server 10.16.43.69:6443@UNCHECKED->FAILED from failed dial"
time="2025-02-20T16:44:53Z" level=info msg="Server 10.16.20.47:6443@UNCHECKED->FAILED from failed dial"
time="2025-02-20T16:44:53Z" level=error msg="failed to get CA certs: Get \"<https://127.0.0.1:6444/cacerts>\": context deadline exceeded (Client.Timeout exceeded while awaiting headers)"
time="2025-02-20T16:45:03Z" level=info msg="Server 10.16.32.61:6443@UNCHECKED->FAILED from failed dial"
time="2025-02-20T16:45:08Z" level=info msg="Module overlay was already loaded"
time="2025-02-20T16:45:08Z" level=info msg="Module nf_conntrack was already loaded"
time="2025-02-20T16:45:08Z" level=info msg="Module br_netfilter was already loaded"
time="2025-02-20T16:45:08Z" level=info msg="Module iptable_nat was already loaded"
time="2025-02-20T16:45:08Z" level=info msg="Module iptable_filter was already loaded"
The following addresses 10.16.43.69:6443 10.16.20.47:6443 10.16.32.61:6443
seem to be the control plane pod IPs, and the 34.x.x.x:6443
address is the LB exposing the control plane.
I found a similar issue: https://github.com/k3s-io/k3s/issues/6698 but I don't think the resolution/issue is entirely the same in this case.creamy-pencil-82913
02/20/2025, 5:31 PMaloof-london-38236
02/20/2025, 5:32 PMaloof-london-38236
02/20/2025, 5:32 PMcreamy-pencil-82913
02/20/2025, 5:33 PMcreamy-pencil-82913
02/20/2025, 5:33 PMaloof-london-38236
02/20/2025, 5:34 PMcreamy-pencil-82913
02/20/2025, 5:35 PMaloof-london-38236
02/20/2025, 5:37 PMcreamy-pencil-82913
02/20/2025, 5:54 PMcreamy-pencil-82913
02/20/2025, 5:56 PMaloof-london-38236
02/20/2025, 6:08 PM- server
- --disable-agent
- --disable=coredns,servicelb,traefik
- --tls-san={{tlsSan}}
- --flannel-backend=none
- --egress-selector-mode=cluster
creamy-pencil-82913
02/20/2025, 6:13 PMkubectl logs
and kubectl exec
work? And what the egress-selector is doing?aloof-london-38236
02/20/2025, 6:14 PMcreamy-pencil-82913
02/20/2025, 6:18 PMcreamy-pencil-82913
02/20/2025, 6:20 PMaloof-london-38236
02/20/2025, 6:21 PM