This message was deleted Rancher Users #k3s

Join Slack

This message was deleted.

# k3s

adamant-kite-43734

01/15/2024, 9:43 PM

This message was deleted.

creamy-pencil-82913

01/15/2024, 9:47 PM

Did you add nvidia runtimeClass resources to the cluster, and set the runtimeClassName for your pod?

creamy-pencil-82913

01/15/2024, 9:48 PM

https://docs.k3s.io/advanced#nvidia-container-runtime-support

creamy-pencil-82913

01/15/2024, 9:48 PM

You must still add a RuntimeClass definition to your cluster, and deploy Pods that explicitly request the appropriate runtime by setting
runtimeClassName: nvidia
in the Pod spec

creamy-pencil-82913

01/15/2024, 9:48 PM

just because the runtimes are there, doesn’t mean that your pods are using them. You have to specifically ask for it.

wide-author-88664

01/15/2024, 11:57 PM

So if I want the NVIDIA Device Plugin pods to run correctly, then I must set the

runtimeClassName: nvidia

via a configmap (or otherwise) in the Helm install?

wide-author-88664

01/16/2024, 12:01 AM

The Helm cmdline is:

Copy code

# helm upgrade -i nvdp nvdp/nvidia-device-plugin \
    --version=0.14.3 \
    --namespace nvidia-device-plugin \
    --create-namespace \
    --set-file config.map.nvdp-config=/root/nvdp-config.yaml

with the configmap currently as:

Copy code

# cat nvdp-config.yaml
version: v1
flags:
  migStrategy: "none"
  failOnInitError: true
  nvidiaDriverRoot: "/"
  plugin:
    passDeviceSpecs: false
    deviceListStrategy: envvar
    deviceIDStrategy: uuid

wide-author-88664

01/16/2024, 12:14 AM

We did add the RuntimeClass to the cluster as so:

creamy-pencil-82913

01/16/2024, 12:32 AM

I’m not sure how to set the runtimeClassName via that chart, but I think you’re on the right path

wide-author-88664

01/19/2024, 12:22 AM

FYI @creamy-pencil-82913 --

--set runtimeClassName=nvidia

does the trick 🙂

wide-author-88664

01/19/2024, 12:22 AM

(they support alternate runtimes in their Helm chart thankfully)

79 Views

Open in Slack

Previous Next