This message was deleted Rancher Users #rke2

Join Slack

This message was deleted.

# rke2

adamant-kite-43734

07/08/2022, 8:23 PM

This message was deleted.

creamy-pencil-82913

07/08/2022, 8:30 PM

unhealthy how? I usually just look at the service logs in journald, and the static pod logs in /var/log/pods. Everything you need will be in one of those two places.

creamy-pencil-82913

07/08/2022, 8:33 PM

oh, and the containerd log file too, where that exists

faint-airport-83518

07/08/2022, 9:02 PM

I’m currently deploying in an environment with spotty egress, so sometimes when I spin up a new control plane node the etcd connection might timeout (according to the journalctl output), for example, just not sure where to start to debug that

creamy-pencil-82913

07/08/2022, 9:11 PM

by egress, do you mean your connection to the internet?

creamy-pencil-82913

07/08/2022, 9:11 PM

the rke2-server logs will show timeouts connecting to etcd during initial startup until the etcd static pod starts up. If you have a spotty internet connection, it could be waiting for the etcd image to pull?

creamy-pencil-82913

07/08/2022, 9:12 PM

Have you tried dropping the airgap image tarballs on the nodes to make sure that all the images are available locally? or using a local registry mirror?

creamy-pencil-82913

07/08/2022, 9:12 PM

the containerd logs and

crictl ps

output will show you what’s going on with that, if it is indeed waiting for the etcd image.

faint-airport-83518

07/09/2022, 4:35 PM

Thanks for the info, I'll try checking out the containerd and crictl stuff next time.

3 Views

Open in Slack

Previous Next