Hi All, I have a longhorn instance running on RKE2...
# longhorn-storage
w
Hi All, I have a longhorn instance running on RKE2 and we're trying to upgrade the cluster but have run into an issue with the
longhorn-manager
stuck in
Init:0/1
Cluster: v1.25.10+rke2r1 Longhorn: 1.4.2 Deployment: Helm we initially tried to upgrade Longhorn to 1.5.5 using helm but it failed. After going through a few github issues we managed to get the upgrade to work
but now all the
longhorn-manager
pods are stuck. I've noticed in
longhorn-manager
daemonset this stanza
Copy code
initContainers:
      - command:
        - sh
        - -c
        - while [ $(curl -m 1 -s -o /dev/null -w "%{http_code}" -k <https://longhorn-admission-webhook:9443/v1/healthz>)
          != "200" ]; do echo waiting; sleep 2; done
        image: longhornio/longhorn-manager:v1.4.2
        imagePullPolicy: IfNotPresent
        name: wait-longhorn-admission-webhook
        resources: {}
but the SVC for
longhorn-admission-webhook
ports is
Copy code
ports:
  - name: admission-webhook
    port: 9502
    protocol: TCP
    targetPort: admission-wh
  selector:
    app: longhorn-manager
looking at one of the manager pods
Copy code
UID          PID    PPID  C STIME TTY          TIME CMD
root           1       0  0 17:04 ?        00:00:00 sh -c while [ $(curl -m 1 -s -o /dev/null -w "%{http_code}" -k <https://longhorn-admission-webhook:9443/v1/healthz>) != "200" ]; do echo waiting; sleep 2; done