bitter-garage-45274
01/09/2023, 7:32 AMbitter-garage-45274
01/09/2023, 7:33 AMwitty-tiger-89786
01/09/2023, 8:15 AMbitter-garage-45274
01/09/2023, 8:20 AMbitter-garage-45274
01/09/2023, 8:21 AMsparse-fireman-14239
01/10/2023, 10:33 AMgreat-photographer-94826
01/12/2023, 11:38 AMminiature-wolf-97427
01/19/2023, 1:23 PMlonghorn-manager-*
pods aren't coming up and instead going in a CrashLoopBackOff
status. Running kubectl logs pod/longhorn-manager-xxxxx -n longhorn-system
gives me:
Defaulted container "longhorn-manager" out of: longhorn-manager, wait-longhorn-admission-webhook (init)
W0119 13:13:36.447109 1 client_config.go:617] Neither --kubeconfig nor --master was specified. Using the inClusterConfig. This might not work.
time="2023-01-19T13:13:36Z" level=info msg="cannot list the content of the src directory /var/lib/rancher/longhorn/engine-binaries for the copy, will do nothing: failed to execute: nsenter [--mount=/host/proc/1/ns/mnt --net=/host/proc/1/ns/net bash -c ls /var/lib/rancher/longhorn/engine-binaries/*], output , stderr ls: cannot access '/var/lib/rancher/longhorn/engine-binaries/*': No such file or directory\n: exit status 2"
I0119 13:13:36.496292 1 leaderelection.go:248] attempting to acquire leader lease longhorn-system/longhorn-manager-upgrade-lock...
I0119 13:13:36.510495 1 leaderelection.go:258] successfully acquired lease longhorn-system/longhorn-manager-upgrade-lock
time="2023-01-19T13:13:36Z" level=info msg="Start upgrading"
time="2023-01-19T13:13:36Z" level=info msg="setting default-engine-image not found"
time="2023-01-19T13:13:46Z" level=error msg="Upgrade failed: upgrade API version failed: cannot create CRDAPIVersionSetting: Internal error occurred: failed calling webhook \"<http://validator.longhorn.io|validator.longhorn.io>\": failed to call webhook: Post \"<https://longhorn-admission-webhook.longhorn-system.svc:9443/v1/webhook/validaton?timeout=10s>\": context deadline exceeded"
time="2023-01-19T13:13:46Z" level=info msg="Upgrade leader lost: <http://c3.hq.iwink.nl|c3.hq.iwink.nl>"
time="2023-01-19T13:13:46Z" level=fatal msg="Error starting manager: upgrade API version failed: cannot create CRDAPIVersionSetting: Internal error occurred: failed calling webhook \"<http://validator.longhorn.io|validator.longhorn.io>\": failed to call webhook: Post \"<https://longhorn-admission-webhook.longhorn-system.svc:9443/v1/webhook/validaton?timeout=10s>\": context deadline exceeded"
acceptable-printer-7134
01/20/2023, 2:50 PMcsi-resizer
in continuous crashloopback. any pointer. attaching logs for reference.wonderful-appointment-6480
01/22/2023, 8:30 PMacceptable-printer-7134
01/24/2023, 11:47 AMlively-balloon-7264
01/24/2023, 12:41 PMmillions-gold-16761
01/24/2023, 9:46 PMspec: failed to generate spec: path "/var/lib/longhorn/" is mounted on "/" but it is not a shared mount
orange-airplane-98016
01/24/2023, 10:35 PMswift-ambulance-60302
01/25/2023, 5:43 PMalert-policeman-61846
01/25/2023, 6:22 PMsparse-fireman-14239
01/27/2023, 8:21 AMsparse-fireman-14239
01/29/2023, 1:53 PMstocky-article-82001
01/31/2023, 7:44 PMstocky-article-82001
01/31/2023, 7:44 PMsparse-businessperson-74827
01/31/2023, 8:00 PMhappy-gigabyte-12209
01/31/2023, 8:42 PMhappy-gigabyte-12209
01/31/2023, 8:42 PMhappy-gigabyte-12209
01/31/2023, 8:43 PMhappy-gigabyte-12209
01/31/2023, 8:43 PMhappy-gigabyte-12209
01/31/2023, 8:44 PMlate-needle-80860
02/02/2023, 9:49 AMsymlinked
/var/lib/kubelet to the dedicated disk >> this to get all Pod related data onto that disk.
Now I’m unfortunately seeing, for workloads using a PVC’s, the following err:
MountVolume.SetUp failed for volume "pvc-7dc11d73-3595-47a9-bb02-a95f23518ca5" : applyFSGroup failed for vol pvc-7dc11d73-3595-47a9-bb02-a95f23518ca5: lstat /k3s-worker-data/kubelet/pods/12bfddbb-a8fe-4edb-9620-6e96f40ce840/volumes/kubernetes.io~csi/pvc-7dc11d73-3595-47a9-bb02-a95f23518ca5/mount: no such file or directory
If I create the mount
directory manually files are created in the dir … However, comparing the permissions on the dir with permissions on another workload running on a cluster where I’m NOT symlinking to dedicate a disk to container data, the permissions are different.
On the non-working node it’s the fsGroup
on the files in sub-dirs to mount
. I’m in doubt in regards to how to set these permissions, if at all I need to.
The Stateful workload I’m troubleshooting do set the fsGroup
in the SecurityContext
in it’s Kind: StatefulSet
manifest.
Any ideas? Suggestions are VERY WELCOMElate-needle-80860
02/02/2023, 12:13 PMkubectl delete …
directly.
Any ideas and thank youmelodic-hamburger-23329
02/03/2023, 2:21 AMnumerous-lighter-90852
02/03/2023, 10:00 AM