This message was deleted Rancher Users #harvester

Join Slack

This message was deleted.

# harvester

adamant-kite-43734

10/31/2023, 7:08 AM

This message was deleted.

prehistoric-balloon-31801

10/31/2023, 7:22 AM

does removing the changes in kubevirt object work?

Copy code

/spec/customizeComponents/patches/

BTW, can you share the reason you want to evaluate a different image? Thanks.

👍 1

salmon-city-57654

10/31/2023, 8:27 AM

I will take a look at it. Could you provide some examples of what you want to apply? (like what image you would like to patch

salmon-city-57654

10/31/2023, 8:27 AM

@powerful-soccer-11224 ^^^

powerful-soccer-11224

10/31/2023, 10:39 AM

@prehistoric-balloon-31801 @salmon-city-57654, my harvester cluster is based on version 1.1.1 and the corresponding version for the kubevirt is v0.54.0 so therefore , I have cloned the kubevirt code (v0.54.0) and made some changes to the virt-launcher code (this is the only component I am interested in) The image has been pushed to our local internal repository and to update the virt-launcher image across both virt-controller and virt-handler I updated the following in kubevirt CR Also please let me know if there's any other place I need to update the virt-launcher image apart from these two

Copy code

customizeComponents:                     
    patches:                               
    - patch: '{"spec":{"template":{"spec":{"containers":[{"name":"virt-controller", "args":["--launcher-image","<http://test-registry.example.com/myrepo/virt-launcher@sha256:9144b0697edd1e26a23282007a017af7564b9813eb32fd5356afaff244c470c5|test-registry.example.com/myrepo/virt-launcher@sha256:9144b0697edd1e26a23282007a017af7564b9813eb32fd5356afaff244c470c5>","--port","8443", "-v", "2"]}]}}}}'
      resourceName: virt-controller
      resourceType: Deployment
      type: strategic                             
    - patch: '{"spec":{"template":{"spec":{"initContainers":[{"name":"virt-launcher","image":"<http://test-registry.example.com/myrepo/virt-launcher@sha256:9144b0697edd1e26a23282007a017af7564b9813eb32fd5356afaff244c470c5|test-registry.example.com/myrepo/virt-launcher@sha256:9144b0697edd1e26a23282007a017af7564b9813eb32fd5356afaff244c470c5>","imagePullPolicy":"Always"}]}}}}'
      resourceName: virt-handler
      resourceType: Daemonset
      type: strategic

I tried removing

/spec/customizeComponents/patches/

and it did kind of work but wanted to be sure if there's any other process to be followed here

thousands-action-31843

10/31/2023, 3:18 PM

@salmon-city-57654 This isn't my thread, but to give you a heads up, one reason to want a newer/updated KubeVirt is because SR-IOV is unusable in a lot of situations in the current version of Harvester due to a bug that existed in 0.54, is fixed in the 0.54 release branch, but not yet pulled back into the 0.54.0-150400.3.19.1 version. Basically the SR-IOV devices are not presented to the VM in the order they are assigned and they may not even be presented to the correct VMs due to the mapping bug. I think https://github.com/kubevirt/kubevirt/pull/8415 is a link to one of the related fixes that hasn't been pulled in to the builds yet.

👍 1

salmon-city-57654

10/31/2023, 5:18 PM

Hi @powerful-soccer-11224, yes, if you want to patch the

virt-launcher

image. These two parts should be patched. Did you try after you patched the

virt-launcher

image? Also, if you want to rollback, just remove your added parts from the

/spec/customizeComponents/patches/

on the

KubeVirt

CR. Did it work now?

powerful-soccer-11224

10/31/2023, 5:20 PM

I removed them but still no luck while rolling back to previous image registry.suse.com/suse/sles/15.4/virt-launcher:0.54.0-150400.3.3.2

salmon-city-57654

10/31/2023, 5:21 PM

Thanks, @thousands-action-31843; that’s a good example of why we need to patch the Kubevirt image.

salmon-city-57654

10/31/2023, 5:22 PM

@powerful-soccer-11224, Could you try the patched way like the above, but replace your image with the previous one?

powerful-soccer-11224

10/31/2023, 5:25 PM

I tried that too by updating the kubevirt-controller in

/spec/customizeComponents/patches/

to use the image registry.suse.com/suse/sles/15.4/virt-launcher:0.54.0-150400.3.7.1 but the controller pods are in CrashLoopBackOff state and on inspecting the container logs , I found this :

exec /usr/bin/virt-controller: exec format error

salmon-city-57654

10/31/2023, 5:27 PM

Could you check the virt-controller of deployment? What is the virt-controller image you are using

powerful-soccer-11224

10/31/2023, 5:29 PM

From the virt-controller deployment

Copy code

Events:                                                                                                                                                                                        │
│   Type     Reason          Age                   From               Message                                                                                                                    │
│   ----     ------          ----                  ----               -------                                                                                                                    │
│   Normal   Scheduled       38m                   default-scheduler  Successfully assigned harvester-system/virt-controller-9c6f7799-lwhsj to iaas-node-002                                     │
│   Normal   AddedInterface  38m                   multus             Add eth0 [10.52.2.151/32] from k8s-pod-network                                                                             │
│   Normal   Pulled          37m (x4 over 38m)     kubelet            Container image "registry.suse.com/suse/sles/15.4/virt-controller:0.54.0-150400.3.7.1" already present on machine          │
│   Normal   Created         37m (x4 over 38m)     kubelet            Created container virt-controller                                                                                          │
│   Normal   Started         37m (x4 over 38m)     kubelet            Started container virt-controller                                                                                          │
│   Warning  BackOff         3m7s (x179 over 38m)  kubelet            Back-off restarting failed container

salmon-city-57654

10/31/2023, 5:34 PM

Could you check the following command output?

Copy code

kubectl get deployment virt-controller -n harvester-system -o yaml |yq -e .spec.template.spec.containers

salmon-city-57654

10/31/2023, 5:37 PM

And if you use the image you patched, does that work well?

powerful-soccer-11224

10/31/2023, 5:48 PM

here's the O/P of command

Copy code

- args:
    - --launcher-image
    - <http://registry.suse.com/suse/sles/15.4/virt-launcher:0.54.0-150400.3.7.1|registry.suse.com/suse/sles/15.4/virt-launcher:0.54.0-150400.3.7.1>
    - --port
    - "8443"
    - -v
    - "2"
  command:
    - virt-controller
  image: <http://registry.suse.com/suse/sles/15.4/virt-controller:0.54.0-150400.3.7.1|registry.suse.com/suse/sles/15.4/virt-controller:0.54.0-150400.3.7.1>
  imagePullPolicy: IfNotPresent
  livenessProbe:
    failureThreshold: 8
    httpGet:
      path: /healthz
      port: 8443
      scheme: HTTPS
    initialDelaySeconds: 15
    periodSeconds: 10
    successThreshold: 1
    timeoutSeconds: 10
  name: virt-controller
  ports:
    - containerPort: 8443
      name: metrics
      protocol: TCP
  readinessProbe:
    failureThreshold: 3
    httpGet:
      path: /leader
      port: 8443
      scheme: HTTPS
    initialDelaySeconds: 15
    periodSeconds: 10
    successThreshold: 1
    timeoutSeconds: 10
  resources:
    requests:
      cpu: 10m
      memory: 150Mi
  terminationMessagePath: /dev/termination-log
  terminationMessagePolicy: File
  volumeMounts:
    - mountPath: /etc/virt-controller/certificates
      name: kubevirt-controller-certs
      readOnly: true
    - mountPath: /profile-data
      name: profile-data

And If I use my custom image , it works very well . The only time it's causing the issue it while rolling back to the same image registry.suse.com/suse/sles/15.4/virt-controller:0.54.0-150400.3.7.1, one of the pods is in crashLoopBackOff state

salmon-city-57654

11/01/2023, 2:53 AM

Interesting… I will try to reproduce this on my v1.1.1 cluster.

salmon-city-57654

11/01/2023, 6:53 AM

Hi @powerful-soccer-11224, I tried the following steps. Could you check again what I miss? 1. install one node harvester v1.1.1 2. patch the virt-controller by add following patch (mainly change is the kubevirt image version from

0.54.0-150400.3.7.1

0.54.0-150400.3.19.1

Copy code

- patch: '{"spec":{"template":{"spec":{"containers":[{"name":"virt-controller",
        "args":["--launcher-image","registry.suse.com/suse/sles/15.4/virt-launcher:0.54.0-150400.3.19.1","--port","8443",
        "-v", "2"],"image":"registry.suse.com/suse/sles/15.4/virt-controller:0.54.0-150400.3.19.1","imagePullPolicy":"Always"}]}}}}'
      resourceName: virt-controller
      resourceType: Deployment
      type: strategic

3. check the virt-controller deployment contain the above patch 4. check the virt-controller pod running normally 5. remove the above patch 6. check the virt-controller deployment with the default config 7. check the virt-controller pod running normally I did not encounter any errors.

salmon-city-57654

11/01/2023, 3:23 PM

Also, could you provide the md5 for the binary

virt-controller

you ran into an error?

salmon-city-57654

11/01/2023, 3:23 PM

@powerful-soccer-11224 ^^^

salmon-city-57654

11/01/2023, 3:39 PM

And, could you check the kernel log to see if there are any errors? Thanks!

powerful-soccer-11224

11/02/2023, 7:24 AM

Hi @salmon-city-57654 sorry for the late reply. It was on leave yesterday. I tried the above steps again and things seems to be working fine. The problem was with one of the nodes on harvester where I am encountering the error : exec

/usr/bin/virt-controller: exec format error

. I am still not able to find out why it's working on the remaining 2 nodes and only failing on this node even though all the configuration is same for all nodes

salmon-city-57654

11/02/2023, 9:09 AM

Hi @powerful-soccer-11224, could you try the workaround on the https://github.com/harvester/harvester/issues/4434 mentioned?

salmon-city-57654

11/02/2023, 9:10 AM

In the

Additional context

section

powerful-soccer-11224

11/02/2023, 12:50 PM

@salmon-city-57654 I followed the steps mentioned in the workaround and it seems to be working. Thanks a ton for all your help 🙂

👍 1

3 Views

Open in Slack

Previous Next