This message was deleted Rancher Users #rancher-desktop

Join Slack

This message was deleted.

# rancher-desktop

adamant-kite-43734

11/22/2024, 1:55 PM

This message was deleted.

wide-mechanic-33041

11/22/2024, 2:02 PM

yeah it will be a bit more complicated as i believe CUDA is glibc and Alpine which I thought is what RD used for a container host is musl. nvidia has resisted releasing a musl option and jamming glibc into the host os may have other issues.

wide-mechanic-33041

11/22/2024, 2:04 PM

https://learn.microsoft.com/en-us/windows/ai/directml/gpu-cuda-in-wsl has more information on prereqs for running CUDA specifically.

quaint-psychiatrist-36439

11/22/2024, 2:07 PM

One quick doubt : RD is working on K3S and K3S is able to use the GPU capability . Is there any difference in both

wide-mechanic-33041

11/22/2024, 2:08 PM

for your specific needs i would say stick with Docker Desktop as it would be cheaper than moving to ROCm and i wouldn't hold your breath on nvidia supporting Alpine in the near term. Having RD have a debian option for RD would also take a bit of time so easiest if you need CUDA would be stick with DD.

wide-mechanic-33041

11/22/2024, 2:08 PM

it would be the same container host so are you sure its hitting CUDA?

quaint-psychiatrist-36439

11/22/2024, 2:10 PM

We have tested K3s on edge server not on local. RD we tried with local machine having GPU.

wide-mechanic-33041

11/22/2024, 2:10 PM

yeah k3s is just an orchestrator?

wide-mechanic-33041

11/22/2024, 2:14 PM

the container host needs to support the kernel drivers which is the problem. you have a musl c support and not glibc and nvidia's drivers depend on glibc from what i know.

quaint-psychiatrist-36439

11/22/2024, 2:15 PM

Okay, need to check on that.

quaint-psychiatrist-36439

11/22/2024, 2:15 PM

Thank you @wide-mechanic-33041

wide-mechanic-33041

11/22/2024, 2:17 PM

you might be able to hack something in, but it will be super fragile and performance and functionality will be different. I say "there be dragons" so keep DD around for these users https://wiki.alpinelinux.org/wiki/NVIDIA

👍 1

quaint-psychiatrist-36439

11/22/2024, 2:27 PM

can test for one machine, need to check on it. Open WebUI extension for RD will also have same issue ?

quaint-psychiatrist-36439

11/22/2024, 2:31 PM

The hack appears to be challenging since it involves working with drivers.

wide-mechanic-33041

11/22/2024, 2:32 PM

well open webui is an interface? what are you using for inference?

quaint-psychiatrist-36439

11/22/2024, 2:34 PM

well , team was checking Rancher desktop ui to check .. other than that VS code to run the container already enabled with GPU

wide-mechanic-33041

11/22/2024, 2:34 PM

no RD would be your orchestrator. what are you using to host the actual models used for inference?

wide-mechanic-33041

11/22/2024, 2:35 PM

many support CPU based which should work perfectly fine. AVX2 isn't a barn burner, but you can get ok pp and tg token throughput

wide-mechanic-33041

11/22/2024, 2:36 PM

i know webui tends to recommend ollama given its adaptability, but it can use any openapi compatible server

quaint-psychiatrist-36439

11/22/2024, 2:39 PM

I need to check with DS team on this part.. What I got update is local desktop with GPU with it. Interface part will have to check with them once.

wide-mechanic-33041

11/22/2024, 2:43 PM

well RD won't be able to use a local nvidia GPU because of the lack of musl compatible drivers. That just isn't an option. so you will need to look at other options like sticking w docker desktop where mobyvm is glibc, getting an AMD card which should support musl linking, or use CPU inference

wide-mechanic-33041

11/22/2024, 2:43 PM

technically you could also use external inference, but sounded like you wanted this to all be local

fast-garage-66093

11/22/2024, 9:26 PM

The Open WebUI extension will be included in Rancher Desktop 1.17. It will run Ollama on the host, to get access to the GPU and the rests of the app (webui, search) inside containers.

👍 1

146 Views

Open in Slack

Previous Next