Open zibberzoo opened 1 year ago
I have an Nvidia RTX 5000 that works flawlessly with docker under WSL2 on Windows 11 Pro. However, using the default docker-compose-13b.yml which is set to use Docker-cuda with 64 GPU layers, I don't see any GPU activity -- only high CPU usage (i7-9850H) and extremely slow results. Am I misinterpreting GPU support? Thanks. -Z
Same issue here, only difference is that I use a Quadro P5000 GPU.
nvtop even shows a running compute task correctly, it also fills the VRAM, it just has almost none to extremely little GPU-usage while the CPU runs at full throttle.
Same here. 4060ti w/16gb vram. No GPU use, just 100% cpu.
RTX 3070 same problem - zero GPU usage 100% multi core CPU usage, I am running 5700G so wondered if it was caused by the IGPU but it looks like I'm not alone and it's not just an AMD issue.
Same thing here with a p100, 100% cpu usage....
Using RTX 3080 on v1.0.6, CPU also 100%."
+1
Tesla P4 & Quadro P400 Log recognizes the P4, also says it's been set as the primary device. Zero GPU load, 100% CPU when generating responses.
same issue for Tesla T4
same issue here 3060 12GB
I had the same issue, tell I started to add more (Number Of GPU Layers) I started with 64, and started to see more GPU (Quadro RTX 4000) the CPU (AMD 5800x) usage and the LLM moved into my GPU ram. Now I'm at 1024 but 512 worked just as good for me. I'm running unraid. Note: it does still use CPU but nowhere near as long. Hope this Helps
I have an Nvidia RTX 5000 that works flawlessly with docker under WSL2 on Windows 11 Pro. However, using the default docker-compose-13b.yml which is set to use Docker-cuda with 64 GPU layers, I don't see any GPU activity -- only high CPU usage (i7-9850H) and extremely slow results. Am I misinterpreting GPU support? Thanks. -Z