abundant-hair-58573
06/05/2024, 6:23 PM[INFO] Label: <http://cattle.io/os=linux|cattle.io/os=linux>
[INFO] Role requested: worker
[INFO] Using default agent configuration directory /etc/rancher/agent
[INFO] Using default agent var directory /var/lib/rancher/agent
[INFO] Determined CA is not necessary to connect to Rancher
[INFO] Successfully tested Rancher connection
[INFO] Downloading rancher-system-agent binary from <https://rancher-url/assets/rancher-system-agent-amd64>
[INFO] Successfully downloaded the rancher-system-agent binary.
[INFO] Downloading rancher-system-agent-uninstall.sh script from <https://rancher-url/assets/system-agent-uninstall.sh>
[INFO] Successfully downloaded the rancher-system-agent-uninstall.sh script.
[INFO] Generating Cattle ID
curl: (28) Operation timed out after 60001 milliseconds with 0 bytes received
[ERROR] 000 received while downloading Rancher connection information. Sleeping for 5 seconds and trying again
[ERROR] 500 received while downloading Rancher connection information. Sleeping for 5 seconds and trying again
curl: (28) Operation timed out after 60001 milliseconds with 0 bytes received
abundant-hair-58573
06/05/2024, 6:46 PMmessage: waiting for agent to check in and apply initial plan
reason: Waiting
status: Unknown
type: Reconciled
creamy-pencil-82913
06/05/2024, 7:46 PMcreamy-pencil-82913
06/05/2024, 7:47 PMcreamy-pencil-82913
06/05/2024, 7:47 PMabundant-hair-58573
06/05/2024, 7:58 PMabundant-hair-58573
06/05/2024, 8:00 PMabundant-hair-58573
06/05/2024, 8:01 PMabundant-hair-58573
06/05/2024, 8:03 PM+ grep -q '<http://node-role.kubernetes.io/controlplane|node-role.kubernetes.io/controlplane>: "true"' /host/var/lib/rancher/agent/tmp/tmp.wRwguY8TlY/node.yaml
2024-06-05T19:50:58.471695494Z + '[' -z ]
+ grep -q '<http://node-role.kubernetes.io/control-plane|node-role.kubernetes.io/control-plane>: "true"' /host/var/lib/rancher/agent/tmp/tmp.wRwguY8TlY/node.yaml
2024-06-05T19:50:58.476827471Z + '[' -z ]
2024-06-05T19:50:58.476984524Z + grep -q '<http://node-role.kubernetes.io/worker|node-role.kubernetes.io/worker>: "true"' /host/var/lib/rancher/agent/tmp/tmp.wRwguY8TlY/node.yaml
2024-06-05T19:50:58.482114851Z + export 'CATTLE_AGENT_BINARY_LOCAL=true'
2024-06-05T19:50:58.482149972Z + export 'CATTLE_AGENT_UNINSTALL_LOCAL=true'
2024-06-05T19:50:58.482158152Z + export 'CATTLE_AGENT_BINARY_LOCAL_LOCATION=/var/lib/rancher/agent/tmp/tmp.wRwguY8TlY/rancher-system-agent'
+ export 'CATTLE_AGENT_UNINSTALL_LOCAL_LOCATION=/var/lib/rancher/agent/tmp/tmp.wRwguY8TlY/rancher-system-agent-uninstall.sh'
2024-06-05T19:50:58.482170762Z + '[' -s /host/etc/systemd/system/rancher-system-agent.env ]
+ chroot /host /var/lib/rancher/agent/tmp/tmp.wRwguY8TlY/install.sh
[FATAL] You must select at least one role.
2024-06-05T19:50:58.510432123Z + cleanup
+ rm -rf /host/var/lib/rancher/agent/tmp/tmp.wRwguY8TlY
creamy-pencil-82913
06/05/2024, 8:03 PMcreamy-pencil-82913
06/05/2024, 8:03 PM[FATAL] You must select at least one role.
that doesn’t seem rightabundant-hair-58573
06/05/2024, 8:04 PMabundant-hair-58573
06/05/2024, 8:05 PMabundant-hair-58573
06/05/2024, 8:12 PMabundant-hair-58573
06/05/2024, 8:15 PMW0605 19:50:45.333734 1 client_config.go:617] Neither --kubeconfig nor --master was specified. Using the inClusterConfig. This might not work.
2024-06-05T19:50:46.015032960Z time="2024-06-05T19:50:46Z" level=info msg="Applying CRD <http://plans.upgrade.cattle.io|plans.upgrade.cattle.io>"
2024-06-05T19:50:46.750979149Z E0605 19:50:46.750623 1 memcache.go:206] couldn't get resource list for <http://custom.metrics.k8s.io/v1beta1|custom.metrics.k8s.io/v1beta1>: Got empty response for: <http://custom.metrics.k8s.io/v1beta1|custom.metrics.k8s.io/v1beta1>
2024-06-05T19:50:46.753723738Z time="2024-06-05T19:50:46Z" level=info msg="Starting /v1, Kind=Secret controller"
time="2024-06-05T19:50:46Z" level=info msg="Starting /v1, Kind=Node controller"
2024-06-05T19:50:46.785056480Z E0605 19:50:46.784534 1 memcache.go:206] couldn't get resource list for <http://custom.metrics.k8s.io/v1beta1|custom.metrics.k8s.io/v1beta1>: Got empty response for: <http://custom.metrics.k8s.io/v1beta1|custom.metrics.k8s.io/v1beta1>
2024-06-05T19:50:46.790929080Z time="2024-06-05T19:50:46Z" level=info msg="Starting batch/v1, Kind=Job controller"
E0605 19:50:46.826713 1 memcache.go:206] couldn't get resource list for <http://custom.metrics.k8s.io/v1beta1|custom.metrics.k8s.io/v1beta1>: Got empty response for: <http://custom.metrics.k8s.io/v1beta1|custom.metrics.k8s.io/v1beta1>
2024-06-05T19:50:46.832382079Z time="2024-06-05T19:50:46Z" level=info msg="Starting <http://upgrade.cattle.io/v1|upgrade.cattle.io/v1>, Kind=Plan controller"
abundant-hair-58573
06/05/2024, 8:40 PMchroot
2024-06-05T20:34:23.999833095Z + chroot /host /var/lib/rancher/agent/tmp/tmp.6BacBkc7EO/install.sh
2024-06-05T20:34:24.020768795Z [INFO] Using default agent configuration directory /etc/rancher/agent
2024-06-05T20:34:24.020789505Z [INFO] Using default agent var directory /var/lib/rancher/agent
2024-06-05T20:34:24.105857891Z [INFO] Determined CA is not necessary to connect to Rancher
2024-06-05T20:34:24.202566186Z [INFO] Successfully tested Rancher connection
2024-06-05T20:34:24.241500688Z [INFO] Rancher System Agent was detected on this host. Ensuring the rancher-system-agent is stopped.
2024-06-05T20:34:24.289151802Z [INFO] Using local rancher-system-agent binary from /var/lib/rancher/agent/tmp/tmp.6BacBkc7EO/rancher-system-agent
2024-06-05T20:34:24.427582205Z [INFO] Using local rancher-system-agent-uninstall.sh script from /var/lib/rancher/agent/tmp/tmp.6BacBkc7EO/rancher-system-agent-uninstall.sh
2024-06-05T20:34:24.439181254Z [INFO] Generating Cattle ID
2024-06-05T20:34:24.440666763Z [INFO] Cattle ID was already detected as 6d59e8ed240754e19777733c16fd6597be1b769c6b1fcd7ecb7bf4a44368bd8. Not generating a new one.
2024-06-05T20:34:25.141478170Z [INFO] Successfully downloaded Rancher connection information
2024-06-05T20:34:25.141613322Z [INFO] systemd: Creating service file
2024-06-05T20:34:25.146069569Z [INFO] Creating environment file /etc/systemd/system/rancher-system-a
gent.env
2024-06-05T20:34:25.383220384Z [INFO] Enabling rancher-system-agent.service
2024-06-05T20:34:25.574670219Z [INFO] Starting/restarting rancher-system-agent.service
2024-06-05T20:34:25.615165660Z + cleanup
2024-06-05T20:34:25.615189921Z + rm -rf /host/var/lib/rancher/agent/tmp/tmp.6BacBkc7EO
abundant-hair-58573
06/05/2024, 8:59 PM2024/06/05 20:56:07 [ERROR] error syncing 'fleet-default/custom-c653a9a198a3': handler rke-bootstrap: failed to delete fleet-default/custom-c653a9a198a3-machine-bootstrap /v1, Kind=ServiceAccount for rke-bootstrap fleet-default/custom-c653a9a198a3: serviceaccounts "custom-c653a9a198a3-machine-bootstrap" not found, failed to delete fleet-default/custom-c653a9a198a3-machine-plan /v1, Kind=ServiceAccount for rke-bootstrap fleet-default/custom-c653a9a198a3: serviceaccounts "custom-c653a9a198a3-machine-plan" not found, requeuing
abundant-hair-58573
06/05/2024, 9:00 PMabundant-hair-58573
06/05/2024, 10:48 PMcreamy-pencil-82913
06/05/2024, 10:55 PMabundant-hair-58573
06/05/2024, 10:57 PMcreamy-pencil-82913
06/05/2024, 10:59 PMabundant-hair-58573
06/05/2024, 11:00 PMNodenotfound
status from the other issue we've discussed. I have a script that clears those out while checking for healthy nodes with the same name, but I'm reluctant to run that if all of the healthy nodes aren't showing up as a Machine resource, since it will kill those https://github.com/rancher/rancher/issues/45646abundant-hair-58573
06/05/2024, 11:06 PMJun 05 22:52:58 ip-10-114-30-134.domain.org rke2[7804]: time="2024-06-05T22:52:58Z" level=info msg="Adding server to load balancer rke2-api-server-agent-load-balancer: 10.114.38.114:6443"
Jun 05 22:52:58 ip-10-114-30-134.domain.org rke2[7804]: time="2024-06-05T22:52:58Z" level=info msg="Updated load balancer rke2-api-server-agent-load-balancer server addresses -> [10.114.30.239:6
Jun 05 22:52:58 ip-10-114-30-134.domain.org rke2[7804]: time="2024-06-05T22:52:58Z" level=info msg="Adding server to load balancer rke2-agent-load-balancer: 10.114.38.114:9345"
Jun 05 22:52:58 ip-10-114-30-134.domain.org rke2[7804]: time="2024-06-05T22:52:58Z" level=info msg="Updated load balancer rke2-agent-load-balancer server addresses -> [10.114.30.239:9345 10.114.
Jun 05 22:52:58 ip-10-114-30-134.domain.org rke2[7804]: time="2024-06-05T22:52:58Z" level=info msg="Connecting to proxy" url="<wss://10.114.38.114:9345/v1-rke2/connect>"
creamy-pencil-82913
06/05/2024, 11:08 PMabundant-hair-58573
06/05/2024, 11:10 PMabundant-hair-58573
06/05/2024, 11:17 PMtime="2024-06-05T22:45:40Z" level=error msg="Error during subscribe websocket: close sent"
2024-06-05T22:46:00.942449069Z time="2024-06-05T22:46:00Z" level=error msg="Error during subscribe websocket: close sent"
2024-06-05T23:11:57.553310379Z W0605 23:11:57.552896 56 transport.go:301] Unable to cancel request for *client.addQuery
2024-06-05T23:11:57.567532009Z W0605 23:11:57.567164 56 transport.go:301] Unable to cancel request for *client.addQuery
I went ahead and deleted those one at a time, guess I could've just redeployed the cattle-cluster-agent deployment