This message was deleted Rancher Users #longhorn-storage

Join Slack

This message was deleted.

# longhorn-storage

adamant-kite-43734

10/22/2024, 3:32 PM

This message was deleted.

powerful-librarian-10572

10/22/2024, 3:46 PM

Try the upgrade again?

nice-businessperson-14225

10/22/2024, 3:46 PM

Well the upgrade never failed in the first place

nice-businessperson-14225

10/22/2024, 3:47 PM

the cluster seems to think 1.7.1 is installed properly

powerful-librarian-10572

10/22/2024, 3:47 PM

Yeah but you can still upgrade an app in place

nice-businessperson-14225

10/22/2024, 3:47 PM

allright, will try

powerful-librarian-10572

10/22/2024, 3:47 PM

Just do reinstall/upgrade -> next -> next

powerful-librarian-10572

10/22/2024, 3:47 PM

It will be basically a redeploy

nice-businessperson-14225

10/22/2024, 3:47 PM

Im worried about hitting the same problem though

powerful-librarian-10572

10/22/2024, 3:48 PM

It cant be worse anyway no?

nice-businessperson-14225

10/22/2024, 3:53 PM

It worked, but it didnt seem to change anything (i.e. I still see longhorn components versioned at 1.6.2)

nice-businessperson-14225

10/22/2024, 3:53 PM

Im also hitting this now:

Copy code

MountVolume.MountDevice failed for volume "pvc-7382a499-3d90-477d-a095-5e476e95b0cd" : rpc error: code = Internal desc = mount failed: exit status 32 Mounting command: mount Mounting arguments: -t ext4 -o defaults /dev/longhorn/pvc-7382a499-3d90-477d-a095-5e476e95b0cd /var/lib/kubelet/plugins/kubernetes.io/csi/driver.longhorn.io/7679414816d052a2e817f12e9023f7cdc0eb6094ff830b369ee09fb6f8a72513/globalmount Output: mount: /var/lib/kubelet/plugins/kubernetes.io/csi/driver.longhorn.io/7679414816d052a2e817f12e9023f7cdc0eb6094ff830b369ee09fb6f8a72513/globalmount: /dev/longhorn/pvc-7382a499-3d90-477d-a095-5e476e95b0cd already mounted or mount point busy. dmesg(1) may have more information after failed mount system call.

powerful-librarian-10572

10/22/2024, 3:54 PM

huh

nice-businessperson-14225

10/22/2024, 3:59 PM

Any idea @powerful-librarian-10572? I'm pretty stumped myself at this point

powerful-librarian-10572

10/22/2024, 4:03 PM

this might be dangerous, by you can try (emphasis on try, as it may explode everything, altough it didnt cause issue on my test cluster) to redeploy the longhorn-manager daemonset

nice-businessperson-14225

10/22/2024, 4:03 PM

Ultimately I think the issue is with lingering 1.6.2 components alongside 1.7.1 components, would you agree?

nice-businessperson-14225

10/22/2024, 4:04 PM

Perhaps 1.7.1 can't fully take over since 1.6.2 is still being used somewhere by the system

powerful-librarian-10572

10/22/2024, 4:04 PM

Probably, altough having images engine of 1.6.2 will not cause any issues

powerful-librarian-10572

10/22/2024, 4:04 PM

What components are stuck on 1.6.2 ?

nice-businessperson-14225

10/22/2024, 4:07 PM

rancher/mirrored-longhornio-longhorn-manager:v1.6.2 rancher/mirrored-longhornio-longhorn-manager:v1.6.2 (twice on the same node apparently?) rancher/mirrored-longhornio-longhorn-engine:v1.6.2 rancher/mirrored-longhornio-longhorn-instance-manager:v1.6.2

nice-businessperson-14225

10/22/2024, 4:07 PM

~~I imagine its the instance manager causing most problems~~

powerful-librarian-10572

10/22/2024, 4:08 PM

yes and the instance manager is a pod so no clue

nice-businessperson-14225

10/22/2024, 4:09 PM

wdym?

powerful-librarian-10572

10/22/2024, 4:10 PM

I don't know how to fix those

nice-businessperson-14225

10/22/2024, 4:11 PM

Ah, ok, thanks for the help anyway

nice-businessperson-14225

10/22/2024, 4:11 PM

I wish I could just go back in time

powerful-librarian-10572

10/22/2024, 4:11 PM

try to fiddle around with the longhorn.io.instancemanager objects

powerful-librarian-10572

10/22/2024, 4:12 PM

You can also try to restore from etcd snapshot but im not sure that it would work

nice-businessperson-14225

10/22/2024, 4:12 PM

yeah Im worried about restoring and borking things even more

86 Views

Open in Slack

Previous Next