This message was deleted.
# rke
a
This message was deleted.
m
Having the same issue
e
Hi, I managed to work this out using this from the release notes of rancher 2.8.0:
Known Issues • Scaling up etcd nodes on RKE may fail, with nodes stuck waiting to register with Kubernetes. This causes the cluster to hang. There are two workarounds available, depending on whether the cluster is active or hanging. See #43356. • Workaround for active clusters: ◦ Add one etcd node, wait for the cluster to become active again, then repeat as needed. • Workaround for hanging clusters: i. Delete the stuck etcd nodes. ii. Find the leader pod:
kubectl -n kube-system get configmap cattle-controller
iii. Restart the leader pod. This terminates the GRPC goroutine. iv. Wait for the stuck nodes to be removed. a. Add one etcd node, wait for the cluster to become active again, then repeat as needed.
And now, adding one etcd node at a time works.