This message was deleted.
# rke2
a
This message was deleted.
e
Lastly, if I copy /var/lib/rancher/rke2/agent/server-ca.crt on the agent itself, and add it to the OS trust store it will also allow the agent to start without issue, so it does appear to be TLS/cert related.
a
As far as I know you should only be upgrading by one minor version at a time, https://kubernetes.io/releases/version-skew-policy/#supported-version-skew
e
I just used 1.26.9 as a test, but this occurs even if I start with all new nodes at the 1.31.2 version, and occurs with any version newer than 1.26.x.
As a test I incrementally upgraded from 1.26.9 to 1.27.9. Everything worked as expected. Then I upgraded to 1.28.9 next and now I am back in the same situation with agent nodes unable to join due to certificate errors:
Copy code
Waiting to retrieve agent configuration; server is not ready: CA cert validation failed: Get \"<https://127.0.0.1:6444/cacerts>\"
Any idea what might have been introduced in 1.28.X and newer versions that might be causing this?