Good afternoon folks I have been seeing an issue where on in Rancher Users #rke2

Good afternoon folks, I have been seeing an issue ...

tall-raincoat-70627

05/15/2025, 10:47 PM

Good afternoon folks, I have been seeing an issue where on install it appears that etcd is failing to start, and the bootstrapping just loops around attemtping to connnect to etcd with no success. The image is on the server according to crictl but no pods are running, and the rke2 logs are not showing anything seemingly useful, even with hte debug flag, is there anything I can do to increase the errors being returned from the initial attemps to start etcd?

creamy-pencil-82913

05/15/2025, 10:52 PM

check the kubelet and containerd logs

creamy-pencil-82913

05/15/2025, 10:52 PM

what kind of resources (cpu/memory) do you have on the node?

tall-raincoat-70627

05/15/2025, 10:53 PM

2CPUS 32G of RAM

creamy-pencil-82913

05/15/2025, 10:53 PM

you might also check the etcd logs under /var/log/pods

creamy-pencil-82913

05/15/2025, 10:53 PM

2 cores or 2 sockets?

creamy-pencil-82913

05/15/2025, 10:53 PM

that’s a lot of memory for just 2 cores. You need at least 4 cores for a server.

tall-raincoat-70627

05/15/2025, 10:54 PM

ok, i can bump it up

creamy-pencil-82913

05/15/2025, 10:55 PM

that’s probably not it, but thats definitely low for a server.

tall-raincoat-70627

05/15/2025, 10:55 PM

It doesnt appear to be getting far enoiugh to create /var/log/pods

creamy-pencil-82913

05/15/2025, 10:56 PM

definitely check containerd and kubelet logs then

tall-raincoat-70627

05/15/2025, 11:00 PM

THe only non info message is:

Copy code

time="2025-05-15T22:45:15.024459505Z" level=error msg="failed to load cni during init, please check CRI plugin status before setting up network for pods" error="cni config load failed: no network config found in /etc/cni/net.d: cni plugin not initialized: failed to load cni config"

creamy-pencil-82913

05/15/2025, 11:00 PM

cni doesn’t get installed until later

creamy-pencil-82913

05/15/2025, 11:00 PM

kubelet and containerd are both running?

tall-raincoat-70627

05/15/2025, 11:01 PM

kublet exits when it cant contyact etcd

creamy-pencil-82913

05/15/2025, 11:01 PM

thats not how that works

tall-raincoat-70627

05/15/2025, 11:01 PM

and gets restarted and loops

creamy-pencil-82913

05/15/2025, 11:01 PM

kubelet does not talk to etcd, the apiserver does

tall-raincoat-70627

05/15/2025, 11:01 PM

ok, let me try again

creamy-pencil-82913

05/15/2025, 11:01 PM

what is the exact error the kubelet is exiting with

creamy-pencil-82913

05/15/2025, 11:01 PM

kubelet crashlooping would definitely prevent pods from getting started

tall-raincoat-70627

05/15/2025, 11:03 PM

Copy code

May 15 23:02:48 na-nonprod-dynenvcp-01c.atl01.xx.com rke2[387567]: time="2025-05-15T23:02:48Z" level=info msg="Failed to test etcd connection: failed to get etcd status: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial tcp 127.0.0.1:2379: connect: connection refused\""
May 15 23:02:48 na-nonprod-dynenvcp-01c.atl01.xx.com rke2[387567]: time="2025-05-15T23:02:48Z" level=error msg="Kubelet exited: exit status 1"

creamy-pencil-82913

05/15/2025, 11:04 PM

that’s not the kubelet log. look at the kubelet log.

tall-raincoat-70627

05/15/2025, 11:05 PM

If i make a change to config.yaml will that get picked up between restarts, or do i need to reinstall?

creamy-pencil-82913

05/15/2025, 11:05 PM

just need to restart

tall-raincoat-70627

05/15/2025, 11:06 PM

Yep, my error in the config.yaml passing it an invalid flag

creamy-pencil-82913

05/15/2025, 11:06 PM

that’d do it

tall-raincoat-70627

05/15/2025, 11:06 PM

thanks for the assistance

🙌 1

tall-raincoat-70627

05/15/2025, 11:08 PM

"command failed" err="failed to set feature gates from initial flags-based config: unrecognized feature gate: EphemeralContainers"

tall-raincoat-70627

05/15/2025, 11:08 PM

would be the culprit

15 Views

Open in Slack

Previous Next