While launching `test vm001` the VM is stuck in the Starting Rancher Users #harvester

While launching `test-vm001`, the VM is stuck in t...

millions-afternoon-34677

07/25/2025, 9:44 AM

While launching

test-vm001

, the VM is stuck in the "Starting" state and not reported as running. The error shown in the events section is: It appears that the volume attachment to node

orion1

is timing out. Requesting help in debugging this issue—could be related to Longhorn volume scheduling or node availability.

sparse-vr-27407

07/25/2025, 10:13 AM

Can you describe your cluster? How many nodes, what are node names, which nodes have Longhorn disks? Also if you check out what is on Longhorn UI ?

millions-afternoon-34677

07/25/2025, 10:20 AM

Here’s my cluster setup: • Total nodes: 3 • Node names:

orion

orion0

, and

orion1

• Longhorn disks: All three nodes have storage allocated for Longhorn, as seen in the screenshot. Also, I’d like to know how to access the Longhorn UI and view/manage the Longhorn volumes. Could you please guide me on that?

🙌 1

millions-afternoon-34677

07/25/2025, 10:20 AM

Please look @sparse-vr-27407

millions-afternoon-34677

07/25/2025, 10:22 AM

i want to know what is the exactly issue ?

thousands-advantage-10804

07/25/2025, 11:08 AM

You can access the longhorn UI by enabling Extension Developer feature

thousands-advantage-10804

07/25/2025, 11:08 AM

then the link is available in the support page by clicking “support” in the lower left.

millions-afternoon-34677

07/25/2025, 11:18 AM

Thnx @thousands-advantage-10804

sparse-vr-27407

07/25/2025, 12:55 PM

This "Pereferences" setting is for your user profile, click on the top right user icon and there will it be. It took me a while to find it and after i had to reload the UI (F5) to make it work. In the left menu on the bottom there is a blue text "Support" and after clicking, on the right side there will be 2 extra menu entries, one of them leading you to the Longhorn UI. A screenshot of the dashboard would be nice.

millions-afternoon-34677

07/25/2025, 1:34 PM

@sparse-vr-27407 I'm encountering problems creating VMs when all nodes in the 3-node Harvester cluster are active. • When only the master node is online, VMs create and start successfully. • But when the full cluster is active, VM creation fails with volume attach errors. • VM pod gets scheduled, but the

virt-launcher

fails

sparse-vr-27407

07/25/2025, 1:44 PM

You say when "the" master -> does it mean you have only one, despite 3 nodes and no HA on your control plane ? Please post a

kubectl get nodes

against your cluster. Still, a look at Longhorn dashboard and longhorn node list would my first step to check if it was my problem. I'd check also the volume in the error state under Longhorn UI / Volumes by clicking on it. From the events it looks to me a PVC/PV binding issue, and i'd debug that as such, not focusing on the purpose of the volume. Or rather i'd change the vm's config to prefer scheduling to the master and see if that works out

millions-afternoon-34677

07/25/2025, 1:46 PM

ya we can set option for prefer schedulling to master node but that will not sort our issue.

millions-afternoon-34677

07/25/2025, 1:49 PM

Attached volume section on Longhorn UI

sparse-vr-27407

07/25/2025, 2:07 PM

Sure, but as you said all is fine when only the master is running, i would try to run all 3 nodes but schedule or migrate the vm to the master. If this works (3 nodes running, vm is fine on master, fails on the other) then its a strong clue

sparse-vr-27407

07/25/2025, 2:08 PM

Your Longhorn dashboard shows you have some volumes degraded. A scrennshot of the Nodes in the same Longhorn UI would be interesting

millions-afternoon-34677

07/25/2025, 2:11 PM

I'm facing a problem where the VM fails before even reaching the scheduling stage due to PVC-related issues. • PVC attachment fails, leading to

AttachVolume.Attach failed

errors. • This happens only when all nodes in the 3-node cluster are online. • When I keep only the master node active, VM creation works fine and PVCs are attached successfully.

Copy code

AttachVolume.Attach failed for volume "<pvc-id>" : rpc error: code = Aborted desc = volume <pvc-id> is not ready for workloads

sparse-vr-27407

07/25/2025, 2:18 PM

'This happens only when all nodes in the 3-node cluster are online.' -> understood, my next step would be to check is it specific to a node or not. Configuring the VM to consider only the master node, can be achived by changing 'Node Scheduling' under the VM's setting. Do you want to try this test ?

millions-afternoon-34677

07/26/2025, 4:26 PM

ya , can you provide the steps for doing that ?

sparse-vr-27407

07/28/2025, 7:38 AM

Actually, it is a click under the 'Node Scheduling', which can be found under the VM's setting. Can you post a screenshot of the "Nodes" tab in the same Longhorn UI ?

millions-afternoon-34677

07/28/2025, 8:02 AM

Yes, I understand. That option under 'Node Scheduling' becomes configurable after the VM is created. What I’m trying to highlight is this: If we enable this setting during VM creation, it would benefit the HCI setup using Harvester by allowing better control over node selection for VM scheduling. Right now, I’m observing that once all three nodes in the cluster are up and joined, the master node avoids scheduling VMs on itself, and the scheduler tries to place the VM on other worker nodes. It’s during this placement — particularly at the volume attachment stage — that the issue arises, preventing the VM from being created

sparse-vr-27407

07/29/2025, 7:08 AM

Debugging that volume issue, can you post a screenshot of the "Nodes" tab in the same Longhorn UI ?

7 Views

Open in Slack

Previous Next