millions-france-61475
06/26/2025, 1:30 AMspec.agentConfig.additionalUserData.data
. The node gets deleted as soon as it comes up due to Rolling 1 replicas with outdated spec (0 replicas up to date)
. I suspect this is because the formatting of the additional user data changes slightly from the rke2controlplane to the rke2config due to the way it's rendered. I have maxSurge
set to 0. Does it seem like this could be the cause?worried-mechanic-84133
06/26/2025, 6:28 PMmaxSurge: 0
means the current node must be deleted before a new one can be provisioned. If spec.agentConfig.additionalUserData.data
is even slightly different in formatting between the rke2controlplane
and the rendered rke2config
(e.g., due to whitespace or quote style), it triggers a change in the machine spec. Since the cluster sees it as an outdated spec and no extra node can be created (because maxSurge: 0
), it deletes the only node — causing the cluster to go down.
Recommendation:
Temporarily set maxSurge: 1
and maxUnavailable: 0
to allow the new node to come up before the old one is removed. That will prevent this destructive behavior and let you validate if the issue is caused by spec drift.millions-france-61475
06/27/2025, 12:52 AM