This message was deleted Rancher Users #general

Join Slack

This message was deleted.

# general

adamant-kite-43734

06/15/2023, 7:30 PM

This message was deleted.

creamy-pencil-82913

06/15/2023, 8:03 PM

did you look at the etcd docs? that seems like the more logical place to check. https://etcd.io/docs/v3.5/op-guide/maintenance/#defragmentation

creamy-pencil-82913

06/15/2023, 8:04 PM

Note that defragmentation to a live member blocks the system from reading and writing data while rebuilding its states.

polite-piano-74233

06/15/2023, 8:05 PM

I did see that but couldnt tell if that would have any impact on the running workloads or just folks using kubectl, the rancher docs on it just say ‘run this to fix etcd frag’ without any additional context

creamy-pencil-82913

06/15/2023, 8:06 PM

etcd blocks mean apiserver blocks. if the defrag takes more than about 10 seconds to complete you’ll probably see things start to freak out.

polite-piano-74233

06/15/2023, 8:07 PM

Ah so things like health checks etc?

creamy-pencil-82913

06/15/2023, 8:08 PM

I mean things like controllers not being able to renew their leases, and erroring out and crashing

polite-piano-74233

06/15/2023, 8:10 PM

😱

rough-farmer-49135

06/15/2023, 10:05 PM

So defrag blocks access from all nodes and not just one at a time? I'd read the quote looking like with a three node cluster you'd have two functional etcd nodes while the other blocks a lot.

creamy-pencil-82913

06/15/2023, 10:16 PM

it blocks all access to that etcd server, and each apiserver generally only talks to a single etcd server

👍 1

creamy-pencil-82913

06/15/2023, 10:17 PM

I mean realistically you’ll probably be fine, but if you’re on slow disk or have an excessive amount of deleted data to defrag, you should consider doing it during a period of low utilization

polite-piano-74233

06/15/2023, 10:18 PM

Yea we will be trying to find a maintenance window as a lot of our workloads dont tolerate hiccups well, ty for the infos!

166 Views

Open in Slack

Previous Next