This message was deleted Rancher Users #general

Join Slack

This message was deleted.

# general

adamant-kite-43734

02/28/2023, 6:33 PM

This message was deleted.

creamy-pencil-82913

02/28/2023, 6:35 PM

I am assuming you’re talking about a single-node cluster, and not snapshotting random workers in a larger cluster?

happy-branch-33441

02/28/2023, 6:37 PM

yeah that’s correct

creamy-pencil-82913

02/28/2023, 6:43 PM

well the node is Ready when you snapshot it, right?

creamy-pencil-82913

02/28/2023, 6:44 PM

When you bring it back up, the node is still in the datastore as Ready

happy-branch-33441

02/28/2023, 6:44 PM

right

creamy-pencil-82913

02/28/2023, 6:44 PM

It won’t get moved to NotReady until the controller manager comes up and notices that the kubelet isn’t reporting in

creamy-pencil-82913

02/28/2023, 6:44 PM

so it’ll go to NotReady as the kubelet initializes, and then back to Ready

happy-branch-33441

02/28/2023, 6:44 PM

I’m not surprised that it goes into a NotReady state intermittently, but once it does it takes like 5-15 minutes to go back into a Ready state

creamy-pencil-82913

02/28/2023, 6:46 PM

You could probably check the kubelet logs or just describe it to see what’s blocking readiness

creamy-pencil-82913

02/28/2023, 6:47 PM

I would probably recommend against snapshotting the whole node though, in favor of using etcd and taking etcd snapshots

creamy-pencil-82913

02/28/2023, 6:48 PM

unless for some reason you have other stuff on there that you’re trying to capture in the snapshot, and you’re OK with a potentially inconsistent datastore state

happy-branch-33441

02/28/2023, 6:58 PM

makes sense; and yeah I totally understand etcd snapshots are preferable just not 100% feasible in our slightly-odd use case (effectively using k3s as a replacement for a single-node docker compose type thing)

5 Views

Open in Slack

Previous Next