This message was deleted Rancher Users #rke2

Join Slack

This message was deleted.

# rke2

adamant-kite-43734

06/18/2025, 7:07 PM

This message was deleted.

creamy-pencil-82913

06/18/2025, 7:21 PM

creamy-pencil-82913

06/18/2025, 7:21 PM

the fatal error is your problem

creamy-pencil-82913

06/18/2025, 7:21 PM

you are apparently not setting the correct cluster join token in your config or cli

creamy-pencil-82913

06/18/2025, 7:22 PM

or more likely you’re not setting one at all.

refined-application-74576

06/18/2025, 7:22 PM

the latter, we don't hard-code a token

creamy-pencil-82913

06/18/2025, 7:22 PM

well you need to if you want to join the cluster

creamy-pencil-82913

06/18/2025, 7:23 PM

if you don’t set one, one is generated for you and written to the token file on the server when the first server in the cluster is starting up… but if you’re joining a new server to an existing cluster this file obviously won’t exist yet so you’ll need to put it in the config.

creamy-pencil-82913

06/18/2025, 7:24 PM

this is noted in the docs and at the top of the release notes for every release

creamy-pencil-82913

06/18/2025, 7:24 PM

> If your server (control-plane) nodes were not started with the

--token

CLI flag or config file key, a randomized token was generated during initial cluster startup. This key is used both for joining new nodes to the cluster, and for encrypting cluster bootstrap data within the datastore. Ensure that you retain a copy of this token, as is required when restoring from backup. > You may retrieve the token value from any server already joined to the cluster: >

Copy code

cat /var/lib/rancher/rke2/server/token

refined-application-74576

06/18/2025, 7:25 PM

Right, not joining a new server though. It's a single node instance. I stopped the service, noticed the bad exit. Restarting it resulted in the mismatch error.

creamy-pencil-82913

06/18/2025, 7:25 PM

are you using an external db?

refined-application-74576

06/18/2025, 7:25 PM

kine/sqlite

creamy-pencil-82913

06/18/2025, 7:26 PM

what version of rke2?

creamy-pencil-82913

06/18/2025, 7:27 PM

Did something happen that wiped out the contents of that token file?

refined-application-74576

06/18/2025, 7:28 PM

1.33.1

refined-application-74576

06/18/2025, 7:29 PM

For reasons I can't remember, we modify our service to remove

/data/rancher/rke2/server/cred/passwd

on startup, but never the token.

refined-application-74576

06/18/2025, 7:29 PM

(we use /data instead of /var/lib)

refined-application-74576

06/18/2025, 7:31 PM

For some further context, this was a 'stress test' situation where we filled the disk to nearly full, then restarted our cluster. we also have some custom kubelet args around evictions

Copy code

- eviction-hard=imagefs.available<1%,nodefs.available<1%
    - eviction-minimum-reclaim=imagefs.available=500Mi,nodefs.available=500Mi
    - image-gc-high-threshold=100

creamy-pencil-82913

06/18/2025, 7:31 PM

Something happened to make the contents of the token file no longer match the token previously used. Did the file perhaps get truncated on startup because the disk was full?

refined-application-74576

06/18/2025, 7:33 PM

stat-ing the token it has not changed since before we filled the disk, so does not appear so

creamy-pencil-82913

06/18/2025, 7:33 PM

On startup tries to read the content from the token file if you haven’t set a token in the config, but that means you’re reliant on that token file not getting mangled: https://github.com/k3s-io/k3s/blob/master/pkg/cluster/bootstrap.go#L267-L278

creamy-pencil-82913

06/18/2025, 7:34 PM

The other possibility is that the sqlite db got corrupted

creamy-pencil-82913

06/18/2025, 7:35 PM

you can try using the sqlite cli tools to open the db file and delete any rows that start with

/bootstrap/

refined-application-74576

06/18/2025, 7:35 PM

I'm wondering if a compaction + nearly full disk messed thing sup?

creamy-pencil-82913

06/18/2025, 7:35 PM

but regardless, I would recommend setting a fixed token in your config, even on a single node cluster.

creamy-pencil-82913

06/18/2025, 7:36 PM

there are all kinds of fun things that can happen when you intentionally fill your disks. I can only guess at what actually occured.

refined-application-74576

06/18/2025, 7:36 PM

I'll take a look in the db w/r/t to

/bootstrap

and yea we can look at hardcoding a token for sure.

refined-application-74576

06/18/2025, 7:36 PM

there are all kinds of fun things that can happen when you intentionally fill your disks. I can only guess at what actually occured.

indeed!

refined-application-74576

06/18/2025, 7:37 PM

Thanks for the help!

creamy-pencil-82913

06/18/2025, 7:37 PM

gl!

8 Views

Open in Slack

Previous Next