11/01/2022, 8:31 PM
Not sure this is the right forum but I’m having some issues upgrading rom 2.5.16 to 2.6.x (currently just trying 2.6.1, tried all the way to 2.6.6). My downstream clusters go unavailable with these messages, ideas on how I can further troubleshoot this?
error syncing 'c-57xzh': handler cluster-deploy: ClusterUnavailable 503: ClusterUnavailable 503: cluster not found, requeuing
I have used the Rancher Backup operator so I can reset and restoring to 2.5.16 works fine I just can’t figure out what’s wrong with the upgrade?
I’ve googled the error and gotten anything from recreating expired webhooks (not the issue) to cloud provider bugs, but these use the custom node driver for RKE1
k3s 1.20.15, Rancher 2.5.16 -> 2.6.1, downstream clusters are k8s 1.18.12
Oh so using the Cluster Explorer I can get a slightly further glimpse at
This resource is currently in a transitioning state, but there isn’t a detailed message available.
Okay that helped me find this error in the bundle resources
'rendered manifests contain a resource that already exists. Unable
to continue with install: could not get information about the resource:
<|> "cattle-fleet-system-fleet-agent-role"
is forbidden: User "system:serviceaccount:fleet-system:fleet-agent" cannot
get resource "clusterroles" in API group "<|>" at
the cluster scope: RBAC: <|> "fleet-system-fleet-agent-role"
not found'
Similar looking issue
So probably a fleet related issue
Cross posting