Hi,
I have a situation where my one downstream cluster has crashed. The cluster is RKE cluster where it had HA configuration.
How the downstream cluster crashed:
• We were upgrading Rancher from 2.6.10 to 2.7.6 (here by going from 2.6.10 to 2.6.13 and then to 2.7.6)
• Our downstream cluster was on 19.x.x when we upgraded to 2.5.13
• Now to move from 2.7.6, we had to upgrade the docker version to be on 20.x.x or 23.x.x.
• We upgraded one of the etcd node and it got connected. (At this point we didn’t checked that etcd memberlist had only 2 nodes)
• And then we upgraded docker on the second node and it failed.
Now, before upgrading the second etcd node, we had a snapshot taken. But we are unable to revert the snapshot.
We did use:
https://www.suse.com/support/kb/doc/?id=000020695 but this didn’t worked.
I am looking for any suggestion on recovering downstream cluster.
Thanks in advance