Hello folks I followed <https github com harvester harvester Rancher Users #harvester

Hello folks, I followed <https://github.com/harves...

great-australia-63269

04/15/2025, 3:13 PM

Hello folks, I followed https://github.com/harvester/harvester/issues/6975 to create a storageClass using SAN LUNs on Harvester v1.4.2; I enabled multipath, created a customization file in /oem to add entries in /etc/fstab at boot time, nodes have the SAN disks mounted after reboot, then used the Longhorn UI to add the disks into the nodes, assigned node and disks tags, created a storageClass with nodeselector and diskselector. All seems fine but when I create a PVC from that storageClass I get "No available disk candidates to create a new replica of size ... " and "Cannot find a reusable failed replicas for volume pvc-xxx" errors on the longhorn-manager pods. Anyone can point me where I can get some more logs/troubleshooting info? Thanks in advance

bland-article-62755

04/15/2025, 3:19 PM

You have good output from

multipath -ll

great-australia-63269

04/15/2025, 3:21 PM

yes, I have 4 paths per disk as expected, I did a dd test and write speed is also good

bland-article-62755

04/15/2025, 3:21 PM

I'm a little confused as it looks like the harvester issue you posted is for adding the SAN LUNs to the system for use with longhorn and not a seperate Storage class.

bland-article-62755

04/15/2025, 3:21 PM

Can you dump your storage class yaml thats failing?

great-australia-63269

04/15/2025, 3:23 PM

I used a new storageClass because I only map LUN's to two out of 3 nodes (and I configured 2 replicas for this SC); yes I can paste the SC yaml, give me a sec; I can also take a support bundle

bland-article-62755

04/15/2025, 3:25 PM

The bundle might be good for some Suse folks, but I won't mess with it. (just an Fyi)

bland-article-62755

04/15/2025, 3:31 PM

So here's the thing: If you're adding the path that the LUNs are mapped to as per that last comment in the github issue that's listed as a workaround, the SAN LUNs are abstracted from the k8s cluster. Longhorn (the storage provider) just sees it as another "Disk" on the system. It doesn't care how it's connected it's just raw space. Now... you can add labels to that storage "device" in longhorn to set expectations because of lag/speed. (Maybe your san is too slow for etcd or something) Just like you could for spinning rust vs ssd vs nvme, but by default longhorn is just gonna plop things wherever. It doesn't even matter that only two of the nodes have the SAN attached. It'll make sure there's three copies of the volume somewhere.

great-australia-63269

04/15/2025, 3:31 PM

multipath_out.txt

bland-article-62755

04/15/2025, 3:31 PM

That actually might be a really good thing. If your SAN takes a dump, you'd potentially still have a copy on a hyperconverged local host somewhere and you'd be ok.

thousands-advantage-10804

04/15/2025, 3:34 PM

I would test with 1.5.0 and add a CSI to talk directly to the FC array. Avoid longhorn for this.

bland-article-62755

04/15/2025, 3:35 PM

That's pretty good advice too

bland-article-62755

04/15/2025, 3:35 PM

Though it means you won't be able to take backups.

thousands-advantage-10804

04/15/2025, 3:36 PM

You can! on the array.

bland-article-62755

04/15/2025, 3:36 PM

Via the array, but not via Harvester

thousands-advantage-10804

04/15/2025, 3:36 PM

agreed.

great-australia-63269

04/15/2025, 3:37 PM

I could probably try that, not sure how can I add the CSI but I'll do some reading

bland-article-62755

04/15/2025, 3:37 PM

diskSelector: vm

means you shouldn't need to have the node selector

🙌 1

bland-article-62755

04/15/2025, 3:37 PM

if the longhorn disks have that label on them

great-australia-63269

04/15/2025, 3:37 PM

I remember there was some IBM block operator for FlashSystems, not sure about these old Storwize arrays I have in my lab

thousands-advantage-10804

04/15/2025, 3:38 PM

take a look at

https://youtu.be/5VB8fHJuAQQ▾

bland-article-62755

04/15/2025, 3:38 PM

I would change it to something more obvious like

sanlun

👍 1

thousands-advantage-10804

04/15/2025, 3:38 PM

you will need 1.5.0-rc4

✅ 1

great-australia-63269

04/15/2025, 3:57 PM

@bland-article-62755 I re-created the storageClass with only the diskSelector tag (without nodeSelector) and the volumes are created successfully now, thanks alot for the suggestion!

👍 1

bland-article-62755

04/15/2025, 3:58 PM

🎉

great-australia-63269

04/15/2025, 3:59 PM

I will probably test with the custom CSI as well on some VM's, just for fun

bland-article-62755

04/15/2025, 3:59 PM

having backups through Harvester is worth dealing with the overhead imho. Besides you already did all the hardest parts with getting it into the node.

bland-article-62755

04/15/2025, 3:59 PM

You can always have both.

bland-article-62755

04/15/2025, 4:00 PM

But backups give you a way to migrate VMs to other clusters.

great-australia-63269

04/15/2025, 4:00 PM

yes, longhorn managed backups are nice to have

great-australia-63269

04/15/2025, 4:01 PM

although at some point (for this lab) re-mapping the luns to a different host or snapshoting at the stoarge level could also work

4 Views

Open in Slack

Previous Next