This message was deleted Rancher Users #k3s

Join Slack

This message was deleted.

# k3s

adamant-kite-43734

05/06/2024, 7:43 PM

This message was deleted.

creamy-pencil-82913

05/06/2024, 8:03 PM

revision numbers always go up. any write to the datastore increases the revision number. that is how etcd works.

creamy-pencil-82913

05/06/2024, 8:04 PM

you don’t need to compact or defrag by hand. that happens automatically.

curved-army-69172

05/06/2024, 8:36 PM

hmm, any other ideas why we're seeing this since a few days: Cluster health check failed: Failed to communicate with API server during namespace check: Get "https://10.43.0.1:443/api/v1/namespaces/kube-system?timeout=45s": context deadline exceeded or Cluster health check failed: Failed to communicate with API server during namespace check: Get "https://10.43.0.1:443/api/v1/namespaces/kube-system?timeout=45s": tunnel disconnect on the downstream clusters Sometimes the agent disconnects as well

creamy-pencil-82913

05/06/2024, 8:49 PM

the Rancher server communicates with the downstream clusters through a tunnel that is maintained by the agents, so if the agents go down then yeah… you’d get errors on the Rancher side.

creamy-pencil-82913

05/06/2024, 8:49 PM

Figure out why the agents are disconnecting

curved-army-69172

05/06/2024, 8:50 PM

i should see that in the rancher-system-agent.service logs? right?

creamy-pencil-82913

05/06/2024, 9:12 PM

no, the cattle-cluster-agent pod logs in the downstream cluster.

curved-army-69172

05/06/2024, 9:25 PM

hm, not much besides connection reset by peer. i've done a failover of the loadbalancer in front of rancher, let's see if that helps

17 Views

Open in Slack

Previous Next