This message was deleted Rancher Users #longhorn-storage

Join Slack

This message was deleted.

# longhorn-storage

adamant-kite-43734

06/20/2022, 5:44 PM

This message was deleted.

late-needle-80860

06/20/2022, 5:51 PM

kubectl describe …

on the workloads consuming some of these PVC’s tattle telling something?

late-needle-80860

06/20/2022, 5:51 PM

kubectl describe

on the PVC’s themselves saying something?

late-needle-80860

06/20/2022, 5:52 PM

And how many vols do you have? Thinking if this could be a … wait some more and the

instance-manager

and

engine-manager

is done processing

stocky-beard-10620

06/20/2022, 5:58 PM

The

kubectl describe ...

on the workloads is saying that it's waiting for the volumes to attach and then

timed out waiting for the condition

. The

describe

on the PVs as well as on the PVCs isn't showing anything under

Events

, but I've been at this for a few hours so maybe whatever event showed up there is gone now. I have 15 volumes.

late-needle-80860

06/20/2022, 5:59 PM

15 is kinda nothing … so shouldn’t be that. And nothing obvious in the stdout / container logs of whatever workloads?

stocky-beard-10620

06/20/2022, 6:00 PM

I'm combing through them but I don't think I'm going to find much because these workloads can't even start without the volumes attached

late-needle-80860

06/20/2022, 6:05 PM

ah yeah okay … fair enough. Makes sense.

late-needle-80860

06/20/2022, 6:05 PM

And what about if you port-forward to the Longhorn UI … are the nodes Healthy and schedulable?

stocky-beard-10620

06/20/2022, 6:08 PM

The Longhorn nodes are healthy and scheduleable, but one thing I'm noticing is that all the replicas in all the nodes are either

Failed

Stopped

, or

Unknown

...

stocky-beard-10620

06/20/2022, 6:09 PM

There's also nothing on the

instance-manager

pods logs, neither in the

-e

or the

-r

ones, which is suspicious...

stocky-beard-10620

06/20/2022, 6:11 PM

Also, if I open a given volume, it shows me that the only available replica is stopped but assigned to an instance manager with a name that doesn't match any of the

instance-manager-r-*

pods I see running

stocky-beard-10620

06/20/2022, 6:23 PM

Is it possible that during the upgrade the connection between replicas an instance-managers got lost or out of sync and now I have to do it manually on the custom resource objects directly?

stocky-beard-10620

06/20/2022, 6:47 PM

Let me start a new thread with this because I think this looks like a potential root cause...

late-needle-80860

06/20/2022, 6:54 PM

yeah new thread

5 Views

Open in Slack

Previous Next