future-jordan-75926
11/03/2025, 11:36 AMGet \\"<https://49.13.68.75:10250/containerLogs/airflow/dim-paypal-checked-list-ta5tu5lt/base?follow=true>\\u0026timestamps=true\\": proxy error from 127.0.0.1:9345 while dialing 49.13.68.75:10250, code 502: 502 Bad Gateway","code":500adamant-branch-25874
11/03/2025, 11:48 AMnarrow-guitar-87575
11/03/2025, 11:55 AMadamant-branch-25874
11/03/2025, 11:56 AMfuture-jordan-75926
11/03/2025, 12:09 PMfuture-jordan-75926
11/03/2025, 12:10 PMnarrow-guitar-87575
11/03/2025, 12:11 PMfuture-jordan-75926
11/03/2025, 12:30 PMnarrow-guitar-87575
11/03/2025, 1:57 PMnarrow-guitar-87575
11/03/2025, 1:58 PMfuture-jordan-75926
11/03/2025, 2:08 PMnarrow-guitar-87575
11/03/2025, 2:11 PMfuture-jordan-75926
11/03/2025, 3:27 PMnarrow-guitar-87575
11/03/2025, 3:32 PMcreamy-pencil-82913
11/03/2025, 5:26 PMadamant-branch-25874
11/03/2025, 5:36 PMcreamy-pencil-82913
11/03/2025, 6:04 PMcreamy-pencil-82913
11/03/2025, 6:05 PMkubectl get node -o wide and kubectl get endpoints -n default kubernetes -o yamlcreamy-pencil-82913
11/03/2025, 6:06 PMadamant-branch-25874
11/04/2025, 8:25 AMcreamy-pencil-82913
11/04/2025, 8:42 AMcreamy-pencil-82913
11/04/2025, 8:43 AMfuture-jordan-75926
11/04/2025, 8:52 AMfuture-jordan-75926
11/04/2025, 8:52 AMcreamy-pencil-82913
11/04/2025, 8:57 AMcreamy-pencil-82913
11/04/2025, 8:57 AMcreamy-pencil-82913
11/04/2025, 8:58 AMfuture-jordan-75926
11/04/2025, 9:08 AMk logs airflow-worker-3
Defaulted container "airflow-worker" out of: airflow-worker, dags-git-sync, log-cleanup, dags-git-clone (init), check-db (init), wait-for-db-migrations (init)
Error from server: Get "<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>": proxy error from 127.0.0.1:9345 while dialing 49.13.68.75:10250, code 502: 502 Bad Gateway
agent
root@worker04:~# journalctl -u rke2-agent.service -f
Nov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:17:17Z" level=info msg="Server 23.88.46.2:6443@PREFERRED->HEALTHY from successful health check"
Nov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:04Z" level=info msg="Server 128.140.92.225:6443@RECOVERING->PREFERRED from successful health check"
Nov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:58Z" level=info msg="Server 49.13.13.59:6443@FAILED->RECOVERING from successful health check"
Nov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:58Z" level=info msg="Server 128.140.92.225:6443@PREFERRED->FAILED from failed dial"
Nov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:58Z" level=info msg="Server 23.88.46.2:6443@HEALTHY->ACTIVE from successful dial"
Nov 02 22:19:00 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:59Z" level=info msg="Server 49.13.13.59:6443@RECOVERING->PREFERRED from successful health check"
Nov 02 22:19:00 worker04.blabla rke2[1834650]: time="2025-11-02T22:19:00Z" level=info msg="Server 128.140.92.225:6443@FAILED->RECOVERING from successful health check"
Nov 02 22:19:01 worker04.blabla rke2[1834650]: time="2025-11-02T22:19:01Z" level=info msg="Server 128.140.92.225:6443@RECOVERING->PREFERRED from successful health check"
Nov 02 22:20:00 worker04.blabla rke2[1834650]: time="2025-11-02T22:20:00Z" level=info msg="Server 49.13.13.59:6443@PREFERRED->HEALTHY from successful health check"
Nov 02 22:20:01 worker04.blabla rke2[1834650]: time="2025-11-02T22:20:01Z" level=info msg="Server 128.140.92.225:6443@PREFERRED->HEALTHY from successful health check"
server
Nov 04 09:07:54 master01.blabla rke2[788072]: time="2025-11-04T09:07:54Z" level=error msg="Sending HTTP/1.1 502 response to 127.0.0.1:57440: failed to find Session for client worker04.blabla"creamy-pencil-82913
11/04/2025, 9:14 AMcreamy-pencil-82913
11/04/2025, 9:16 AMfuture-jordan-75926
11/04/2025, 9:18 AMjournalctl -u rke2-agent.service -f | grep "49.13.13.59"
Nov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:58Z" level=info msg="Server 49.13.13.59:6443@FAILED->RECOVERING from successful health check"
Nov 02 22:19:00 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:59Z" level=info msg="Server 49.13.13.59:6443@RECOVERING->PREFERRED from successful health check"
Nov 02 22:20:00 worker04.blabla rke2[1834650]: time="2025-11-02T22:20:00Z" level=info msg="Server 49.13.13.59:6443@PREFERRED->HEALTHY from successful health check"creamy-pencil-82913
11/04/2025, 9:22 AMcreamy-pencil-82913
11/04/2025, 9:23 AMINFO[0000] Updated load balancer rke2-agent-load-balancer default server: 172.17.0.4:9345
INFO[0000] Running load balancer rke2-agent-load-balancer 127.0.0.1:6444 -> [] [default: 172.17.0.4:9345]
INFO[0000] Updated load balancer rke2-api-server-agent-load-balancer default server: 172.17.0.4:6443
INFO[0000] Running load balancer rke2-api-server-agent-load-balancer 127.0.0.1:6443 -> [] [default: 172.17.0.4:6443]
INFO[0010] Got apiserver addresses from supervisor: [172.17.0.4:6443]
INFO[0010] Server 172.17.0.4:6443@STANDBY*->UNCHECKED from add to load balancer rke2-api-server-agent-load-balancer
INFO[0010] Updated load balancer rke2-api-server-agent-load-balancer server addresses -> [172.17.0.4:6443] [default: 172.17.0.4:6443]
INFO[0010] Server 172.17.0.4:9345@STANDBY*->UNCHECKED from add to load balancer rke2-agent-load-balancer
INFO[0010] Updated load balancer rke2-agent-load-balancer server addresses -> [172.17.0.4:9345] [default: 172.17.0.4:9345]
INFO[0010] Connecting to proxy url="<wss://172.17.0.4:9345/v1-rke2/connect>"
INFO[0010] Server 172.17.0.4:9345@UNCHECKED*->RECOVERING from successful dial
INFO[0010] Remotedialer connected to proxy url="<wss://172.17.0.4:9345/v1-rke2/connect>"
INFO[0010] Server 172.17.0.4:6443@UNCHECKED*->RECOVERING from successful health check
INFO[0011] Server 172.17.0.4:9345@RECOVERING*->ACTIVE from successful health check
INFO[0011] Server 172.17.0.4:6443@RECOVERING*->ACTIVE from successful health checkcreamy-pencil-82913
11/04/2025, 9:24 AMfuture-jordan-75926
11/04/2025, 9:28 AMfuture-jordan-75926
11/04/2025, 9:28 AMroot@worker04:~# journalctl -u rke2-agent.service | grep "master01"
root@worker04:~#future-jordan-75926
11/04/2025, 9:29 AM1104 08:04:34.228543 1 timeout.go:140] "Post-timeout activity" logger="UnhandledError" timeElapsed="152.977”s" method="GET" path="/api/v1/namespaces/airbyte/pods/replication-job-230486-attempt-0" result=null
I1104 08:09:49.334529 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
E1104 08:12:48.656379 1 wrap.go:53] "Timeout or abort while handling" logger="UnhandledError" method="GET" URI="/api/v1/namespaces/airbyte/pods/replication-job-230484-attempt-0" auditID="92cf135f-a399-4ad4-ab98-2b357b121f28"
E1104 08:12:48.656452 1 timeout.go:140] "Post-timeout activity" logger="UnhandledError" timeElapsed="5.48”s" method="GET" path="/api/v1/namespaces/airbyte/pods/replication-job-230484-attempt-0" result=null
I1104 08:19:49.334658 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
I1104 08:29:49.335030 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
I1104 08:39:49.336135 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
I1104 08:49:49.336535 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
I1104 08:59:49.336615 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
E1104 09:06:57.924497 1 status.go:71] "Unhandled Error" err="apiserver received an error that is not an metav1.Status: &url.Error{Op:\"Get\", URL:\"<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>\", Err:(*errors.errorString)(0xc05cb532a0)}: Get \"<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>\": proxy error from 127.0.0.1:9345 while dialing 49.13.68.75:10250, code 502: 502 Bad Gateway" logger="UnhandledError"
E1104 09:07:16.288482 1 status.go:71] "Unhandled Error" err="apiserver received an error that is not an metav1.Status: &url.Error{Op:\"Get\", URL:\"<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>\", Err:(*errors.errorString)(0xc04b4c2c80)}: Get \"<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>\": proxy error from 127.0.0.1:9345 while dialing 49.13.68.75:10250, code 502: 502 Bad Gateway" logger="UnhandledError"
E1104 09:07:54.410252 1 status.go:71] "Unhandled Error" err="apiserver received an error that is not an metav1.Status: &url.Error{Op:\"Get\", URL:\"<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>\", Err:(*errors.errorString)(0xc054191f60)}: Get \"<https://49.13.68.75:10250/containerLogs/airflow/airflow-worker-3/airflow-worker>\": proxy error from 127.0.0.1:9345 while dialing 49.13.68.75:10250, code 502: 502 Bad Gateway" logger="UnhandledError"
I1104 09:09:49.336711 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16
I1104 09:19:49.337428 1 cidrallocator.go:277] updated ClusterIP allocator for Service CIDR 10.43.0.0/16future-jordan-75926
11/04/2025, 9:29 AMcreamy-pencil-82913
11/04/2025, 9:30 AMNov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:58Z" level=info msg="Server 128.140.92.225:6443@PREFERRED->FAILED from failed dial"creamy-pencil-82913
11/04/2025, 9:31 AMcreamy-pencil-82913
11/04/2025, 9:32 AMfuture-jordan-75926
11/04/2025, 9:35 AMfuture-jordan-75926
11/04/2025, 9:37 AMNov 02 22:18:59 worker04.blabla rke2[1834650]: time="2025-11-02T22:18:58Z" level=info msg="Server 128.140.92.225:6443@PREFERRED->FAILED from failed dial"
there is no logs for this on server nodefuture-jordan-75926
11/04/2025, 9:38 AMcreamy-pencil-82913
11/04/2025, 9:38 AMINFO[0137] Handling backend connection request [<http://rke2-agent-001.example.com|rke2-agent-001.example.com>]
and then another message like this when it disconnects (unfortunately it doesnât say who is disconnecting)
INFO[1034] error in remotedialer server [400]: websocket: close 1006 (abnormal closure): unexpected EOFcreamy-pencil-82913
11/04/2025, 9:39 AMfuture-jordan-75926
11/04/2025, 9:43 AMNov 02 02:14:02 master02.blabla rke2[691565]: time="2025-11-02T02:14:02Z" level=info msg="error in remotedialer server [400]: read tcp 128.140.92.225:9345->128.140.14.189:3028: i/o timeout"
Nov 03 14:36:37 master02.blabla rke2[691565]: time="2025-11-03T14:36:37Z" level=info msg="error in remotedialer server [400]: read tcp 128.140.92.225:9345->162.55.100.234:3204: i/o timeout"
Nov 03 14:44:55 master02.blabla rke2[691565]: time="2025-11-03T14:44:55Z" level=info msg="error in remotedialer server [400]: read tcp 128.140.92.225:9345->162.55.100.234:32768: i/o timeout"
Nov 02 02:14:04 master03.blabla rke2[689639]: time="2025-11-02T02:14:04Z" level=info msg="error in remotedialer server [400]: read tcp 23.88.46.2:9345->128.140.14.189:59666: i/o timeout"
Nov 03 14:36:23 master03.blabla rke2[689639]: time="2025-11-03T14:36:23Z" level=info msg="error in remotedialer server [400]: read tcp 23.88.46.2:9345->162.55.100.234:11820: i/o timeout"
Nov 03 14:48:56 master03.blabla rke2[689639]: time="2025-11-03T14:48:56Z" level=info msg="error in remotedialer server [400]: read tcp 23.88.46.2:9345->162.55.100.234:2608: i/o timeout"
Nov 02 02:13:55 master01.blabla rke2[788072]: time="2025-11-02T02:13:55Z" level=info msg="error in remotedialer server [400]: read tcp 49.13.13.59:9345->128.140.14.189:36168: i/o timeout"
Nov 03 14:36:35 master01.blabla rke2[788072]: time="2025-11-03T14:36:35Z" level=info msg="error in remotedialer server [400]: read tcp 49.13.13.59:9345->162.55.100.234:57974: i/o timeout"
Nov 03 14:46:41 master01.blabla rke2[788072]: time="2025-11-03T14:46:41Z" level=info msg="error in remotedialer server [400]: read tcp 49.13.13.59:9345->162.55.100.234:28184: i/o timeout"future-jordan-75926
11/04/2025, 9:44 AMfuture-jordan-75926
11/04/2025, 9:44 AMcreamy-pencil-82913
11/04/2025, 9:45 AMfuture-jordan-75926
11/04/2025, 9:46 AMfuture-jordan-75926
11/04/2025, 9:46 AMcreamy-pencil-82913
11/04/2025, 9:47 AMsupervisor-metrics: true on the server, you can check the loadbalancer health metrics on individual nodes:
kubectl get --server <https://AGENT:9345> --raw /metrics | grep rke2_loadbalancercreamy-pencil-82913
11/04/2025, 9:48 AMfuture-jordan-75926
11/04/2025, 9:53 AMcreamy-pencil-82913
11/04/2025, 9:55 AMcreamy-pencil-82913
11/04/2025, 9:58 AMfuture-jordan-75926
11/04/2025, 10:04 AMfuture-jordan-75926
11/04/2025, 10:04 AMfuture-jordan-75926
11/04/2025, 10:05 AMcreamy-pencil-82913
11/04/2025, 10:07 AMfuture-jordan-75926
11/04/2025, 10:08 AMnarrow-guitar-87575
11/04/2025, 10:30 AMfuture-jordan-75926
11/04/2025, 11:10 AMfuture-jordan-75926
11/05/2025, 10:42 AMNov 04 13:10:55 worker06.blabla rke2[3162564]: time="2025-11-04T13:10:55Z" level=info msg="Tunnel authorizer set Kubelet Port 0.0.0.0:10250"
Nov 04 13:11:46 worker06.blabla rke2[3162564]: time="2025-11-04T13:11:46Z" level=info msg="Server 49.13.13.59:9345@PREFERRED->HEALTHY from successful health check"
Nov 04 13:11:46 worker06.blabla rke2[3162564]: time="2025-11-04T13:11:46Z" level=info msg="Server 128.140.92.225:9345@PREFERRED->HEALTHY from successful health check"
Nov 04 13:11:46 worker06.blabla rke2[3162564]: time="2025-11-04T13:11:46Z" level=info msg="Server 23.88.46.2:6443@PREFERRED->HEALTHY from successful health check"
Nov 04 13:11:46 worker06.blabla rke2[3162564]: time="2025-11-04T13:11:46Z" level=info msg="Server 128.140.92.225:6443@PREFERRED->HEALTHY from successful health check"
Nov 05 06:04:21 worker06.blabla rke2[3162564]: time="2025-11-05T06:04:21Z" level=error msg="Error writing ping" error="write tcp 91.107.197.193:10670->23.88.46.2:9345: i/o timeout"
Nov 05 06:04:21 worker06.blabla rke2[3162564]: time="2025-11-05T06:04:21Z" level=error msg="Error writing ping" error="write tcp 91.107.197.193:10670->23.88.46.2:9345: i/o timeout"
managed to replicate issue on v1.34.1creamy-pencil-82913
11/05/2025, 11:06 AMnarrow-guitar-87575
11/05/2025, 11:14 AMfuture-jordan-75926
11/05/2025, 11:15 AMfuture-jordan-75926
11/05/2025, 11:15 AMnarrow-guitar-87575
11/05/2025, 11:16 AMdmesg outputfuture-jordan-75926
11/05/2025, 11:27 AMfuture-jordan-75926
11/05/2025, 11:27 AMfuture-jordan-75926
11/05/2025, 12:15 PMcreamy-pencil-82913
11/05/2025, 5:16 PMfuture-jordan-75926
11/06/2025, 7:29 AMfuture-jordan-75926
11/06/2025, 7:30 AMcreamy-pencil-82913
11/06/2025, 7:32 AMfuture-jordan-75926
11/06/2025, 8:18 AMadamant-branch-25874
11/06/2025, 10:25 AMfuture-jordan-75926
11/06/2025, 10:46 AMadamant-branch-25874
11/06/2025, 10:50 AMfuture-jordan-75926
11/06/2025, 11:00 AMadamant-branch-25874
11/06/2025, 11:23 AMfuture-jordan-75926
11/06/2025, 11:56 AMadamant-branch-25874
11/06/2025, 3:07 PMcreamy-pencil-82913
11/06/2025, 7:51 PMadamant-branch-25874
11/07/2025, 8:13 AM