adamant-kite-43734
06/26/2025, 1:30 AMworried-mechanic-84133
06/26/2025, 6:28 PMmaxSurge: 0
means the current node must be deleted before a new one can be provisioned. If spec.agentConfig.additionalUserData.data
is even slightly different in formatting between the rke2controlplane
and the rendered rke2config
(e.g., due to whitespace or quote style), it triggers a change in the machine spec. Since the cluster sees it as an outdated spec and no extra node can be created (because maxSurge: 0
), it deletes the only node — causing the cluster to go down.
Recommendation:
Temporarily set maxSurge: 1
and maxUnavailable: 0
to allow the new node to come up before the old one is removed. That will prevent this destructive behavior and let you validate if the issue is caused by spec drift.millions-france-61475
06/27/2025, 12:52 AM