This message was deleted Rancher Users #longhorn-storage

Join Slack

This message was deleted.

# longhorn-storage

adamant-kite-43734

03/22/2023, 5:43 AM

This message was deleted.

✅ 2

icy-agency-38675

03/22/2023, 1:35 PM

Can you check the information in the longhorn-manager log?

bitter-tailor-6977

03/23/2023, 4:24 AM

https://github.com/longhorn/longhorn/issues/5558 can you check on this, with latest commends

icy-agency-38675

03/23/2023, 4:24 AM

Are you using RWX volumes?

bitter-tailor-6977

03/23/2023, 4:41 AM

yes, Derek

bitter-tailor-6977

03/23/2023, 4:42 AM

for RWX volumes also workaround will work…??

icy-agency-38675

03/23/2023, 4:46 AM

Yes, check my update.

bitter-tailor-6977

03/23/2023, 4:48 AM

will do and keep posted u

bitter-tailor-6977

03/23/2023, 4:56 AM

unable to delete node: could not delete node ip-172-31-9-232.ec2.internal with node ready condition is False, reason is KubernetesNodeNotReady, node schedulable false, and 0 replica, 0 engine running on it

i hve deleted unknown pod and by using longhorn ui tried to delete the node above message was happening.. my cluster is in k3s

icy-agency-38675

03/23/2023, 4:57 AM

No, I mean try deleting instancemanger resource

icy-agency-38675

03/23/2023, 4:58 AM

Check the second workaround . Don’t need to delete the down node

bitter-tailor-6977

03/23/2023, 4:59 AM

ok fine

bitter-tailor-6977

03/23/2023, 4:59 AM

will do

👍 1

bitter-tailor-6977

03/23/2023, 5:07 AM

it happens again

bitter-tailor-6977

03/23/2023, 5:08 AM

even replica is up but not picking up with that

icy-agency-38675

03/23/2023, 5:08 AM

Can you provide the support bundle?

bitter-tailor-6977

03/23/2023, 5:11 AM

sure

bitter-tailor-6977

03/23/2023, 5:15 AM

Longhorn: https://longhorn.io/docs/1.4.1/deploy/install/install-with-kubectl/ k3s: 3 master and 6 worker nodes

bitter-tailor-6977

03/23/2023, 5:16 AM

no change in longhorn.yaml fine

icy-agency-38675

03/23/2023, 5:17 AM

Sorry, I mean generate a support bundle via UI (bottom left side)

bitter-tailor-6977

03/23/2023, 5:19 AM

cool doing that..

bitter-tailor-6977

03/23/2023, 5:19 AM

issue url: https://github.com/longhorn/longhorn/issues/5558 iam giving this

icy-agency-38675

03/23/2023, 5:22 AM

Sorry, I don’t see the support bundle in the ticket. You can generate a support bundle

tar file

from the UI page.

bitter-tailor-6977

03/23/2023, 5:23 AM

just curious to know, support zip file consist of any sensitive information ?

bitter-tailor-6977

03/23/2023, 5:24 AM

@icy-agency-38675 confirm on this and i got zip file downloaded in my local

icy-agency-38675

03/23/2023, 5:24 AM

You can send to longhorn-support-bundle@suse.com

bitter-tailor-6977

03/23/2023, 5:25 AM

this -just curious to know, support zip file consist of any sensitive information ?

icy-agency-38675

03/23/2023, 5:26 AM

Well, in general, no. But you can hide the information that are sensitive to you by manual.

bitter-tailor-6977

03/23/2023, 5:26 AM

okay sure sending u the zip

bitter-tailor-6977

03/23/2023, 5:34 AM

send the mail pls do the needful

icy-agency-38675

03/23/2023, 5:35 AM

Thank you. Will update if having any finding

bitter-tailor-6977

03/23/2023, 5:38 AM

pls do the needful

icy-agency-38675

03/23/2023, 5:39 AM

Sorry, which volume?

icy-agency-38675

03/23/2023, 5:39 AM

pvc-869f916a-5edc-4547-a62b-679d9cf1dc5b is running and attached

bitter-tailor-6977

03/23/2023, 5:40 AM

yes i have started node again

bitter-tailor-6977

03/23/2023, 5:41 AM

you can check some logs before attached

icy-agency-38675

03/23/2023, 5:42 AM

icy-agency-38675

03/23/2023, 5:46 AM

Do know the boot time of the down node?

bitter-tailor-6977

03/23/2023, 5:47 AM

i am using redhat 8.6 in aws

bitter-tailor-6977

03/23/2023, 5:48 AM

not aware of boottime

icy-agency-38675

03/23/2023, 6:14 AM

Got it. I saw the down node became ready again at

2023-03-23T05:13:54Z

The engine is running at

2023-03-23T05:13:51.887241937Z

The volume was just running right before you started the down nodes…

icy-agency-38675

03/23/2023, 6:15 AM

If you encountered the issue again, please execute the workaround and generate a SB before restarting the down node.

bitter-tailor-6977

03/23/2023, 6:29 AM

okay trying that

bitter-tailor-6977

03/23/2023, 6:44 AM

stuck pvc

bitter-tailor-6977

03/23/2023, 6:44 AM

creating SB right now

icy-agency-38675

03/23/2023, 6:45 AM

deleted the unknown instancemanager resources?

bitter-tailor-6977

03/23/2023, 6:45 AM

yes

icy-agency-38675

03/23/2023, 6:45 AM

bitter-tailor-6977

03/23/2023, 6:49 AM

check inbox

icy-agency-38675

03/23/2023, 6:49 AM

Received

icy-agency-38675

03/23/2023, 6:52 AM

instance-manager-e-b9b279de634f2725487a4ceee253c408 is still there

icy-agency-38675

03/23/2023, 6:52 AM

Can you show

kubectl -n longhorn-system get instancemanagers

bitter-tailor-6977

03/23/2023, 6:52 AM

yes i delete but keep on showing

icy-agency-38675

03/23/2023, 6:53 AM

🤔

bitter-tailor-6977

03/23/2023, 6:53 AM

Copy code

xyz ~ % k delete pod instance-manager-e-b9b279de634f2725487a4ceee253c408 instance-manager-r-b9b279de634f2725487a4ceee253c408 --force -n longhorn-system
warning: Immediate deletion does not wait for confirmation that the running resource has been terminated. The resource may continue to run on the cluster indefinitely.
pod "instance-manager-e-b9b279de634f2725487a4ceee253c408" force deleted
pod "instance-manager-r-b9b279de634f2725487a4ceee253c408" force deleted
xyz ~ % kubectl -n longhorn-system get instancemanagers
NAME                                                  STATE     TYPE      NODE                            AGE
instance-manager-e-b9b279de634f2725487a4ceee253c408   unknown   engine    ip-172-31-8-245.ec2.internal    15h
instance-manager-r-b9b279de634f2725487a4ceee253c408   unknown   replica   ip-172-31-8-245.ec2.internal    15h

bitter-tailor-6977

03/23/2023, 6:54 AM

see i have deleted but still it is showing kubectl

bitter-tailor-6977

03/23/2023, 6:54 AM

even i used force to delete

bitter-tailor-6977

03/23/2023, 6:55 AM

so to delete we need any server side SIGNAL, so it is unable to delete ?

icy-agency-38675

03/23/2023, 6:56 AM

No, let me check

bitter-tailor-6977

03/23/2023, 6:56 AM

once the node is up, the unknow is gone

icy-agency-38675

03/23/2023, 6:58 AM

I see. Need to take time to study. Another and I tried the workaround and it worked. Not sure what happen now. You can start the node. Will keep you posted.

icy-agency-38675

03/23/2023, 6:58 AM

Thank you. Can you provide your env information? Platform and k8s distro and version

icy-agency-38675

03/23/2023, 7:11 AM

Interesting… I can delete it.

Copy code

root@rancher60-master:~/longhorn# kl get instancemanager
NAME                                                  STATE     TYPE      NODE                AGE
instance-manager-r-89ccb5ff5acd0803dc91e2b6355464d5   running   replica   rancher60-worker3   160m
instance-manager-e-89ccb5ff5acd0803dc91e2b6355464d5   running   engine    rancher60-worker3   160m
instance-manager-e-f4731def1cefda4f51631c82ea14ad12   running   engine    rancher60-worker1   160m
instance-manager-r-f4731def1cefda4f51631c82ea14ad12   running   replica   rancher60-worker1   160m
instance-manager-r-21850b3359743fcfc290fc3abf9cae85   unknown   replica   rancher60-worker2   160m
instance-manager-e-21850b3359743fcfc290fc3abf9cae85   unknown   engine    rancher60-worker2   160m
root@rancher60-master:~/longhorn# kl delete instancemanager instance-manager-e-21850b3359743fcfc290fc3abf9cae85
<http://instancemanager.longhorn.io|instancemanager.longhorn.io> "instance-manager-e-21850b3359743fcfc290fc3abf9cae85" deleted
root@rancher60-master:~/longhorn# kl get instancemanager
NAME                                                  STATE     TYPE      NODE                AGE
instance-manager-r-89ccb5ff5acd0803dc91e2b6355464d5   running   replica   rancher60-worker3   160m
instance-manager-e-89ccb5ff5acd0803dc91e2b6355464d5   running   engine    rancher60-worker3   160m
instance-manager-e-f4731def1cefda4f51631c82ea14ad12   running   engine    rancher60-worker1   160m
instance-manager-r-f4731def1cefda4f51631c82ea14ad12   running   replica   rancher60-worker1   160m
instance-manager-r-21850b3359743fcfc290fc3abf9cae85   unknown   replica   rancher60-worker2   160m

bitter-tailor-6977

03/23/2023, 7:17 AM

k3s : v1.25.4+k3s1 Worker and Master: Redhat 8.5 Longhorn: 1.4.1

icy-agency-38675

03/23/2023, 7:17 AM

Gosh…

icy-agency-38675

03/23/2023, 7:17 AM

I know why

icy-agency-38675

03/23/2023, 7:18 AM

not pod… you should delete instancemanager resource kubectl -n longhorn-system get instancemanagers ==> list unknown resources kubectl -n longhorn-system delete instancemanager <name> ==> delete unknown resources

bitter-tailor-6977

03/23/2023, 7:20 AM

ufffff

bitter-tailor-6977

03/23/2023, 7:20 AM

cool finding buddy

bitter-tailor-6977

03/23/2023, 7:20 AM

will confirm with u

🙌 1

bitter-tailor-6977

03/23/2023, 11:28 AM

workaround works man

bitter-tailor-6977

03/23/2023, 11:28 AM

so this fix will be available from 1.4.2

icy-agency-38675

03/23/2023, 11:29 AM

Thank you for the update. Yeah, already merged and will be in v1.4.2.

bitter-tailor-6977

03/23/2023, 11:30 AM

great updating the ticket too u can aslo update

👍 1

67 Views

Open in Slack

Previous Next