nvidia Search Results - Githubissues

1000+ results
for nvidia

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kubernetes-sigs/kueue #3215

Add resource limits into ResourceGroup of ClusterQueue/Cohor…

**What would you like to be added**: Maybe a `limits` field that can be added into `ResourceGroup` struct for `ClusterQueue` or the new `Cohort` CRD. Like this: ```yaml apiVersion: kueue…

FillZpp updated 1 month ago
5
rapidsai/docker #680

[BUG] WSL - No NVIDIA GPU detected but nvidia-smi works and …

**Describe the bug** 1. GPU device **not** found in running RapidsAI Docker container in WSL 2. `nvidia-smi` **can** see the device 2.1. in Windows 2.2. in WSL 2.3. from within the runnin…

adriantorrie updated 5 months ago
2
NVIDIA/DCGM #155

dcgm-exporter crashes hostengine.

Running a [`3.3.5-3.4.0` exporter ](https://github.com/NVIDIA/dcgm-exporter/releases/tag/3.3.5-3.4.0) on a 3.3.5 host-engine as shipped via nvidia-ubuntu-repos SEGFAULTs the Host-engine. Is there s…

krono updated 1 week ago
38
zed-industries/zed #17588

FlatConfig formatting not working with Vue files

### Check for existing issues - [X] Completed ### Describe the bug / provide steps to reproduce it Put simply, Zed doesn't respect the FlatConfig when formatting Vue files. Weirdly enough, it work…

SSebigo updated 2 weeks ago
4
huggingface/trl #2294

OOM when finetuning Llama3.2-90B on 8xA100 80GB

### System Info trl, transformers: most recent on github python 3.10.11 ubuntu 22 package versions: ``` accelerate==1.0.1 addict==2.4.0 aiohappyeyeballs==2.4.3 aiohttp==3.10.10 aiosignal…

maximilianmordig updated 1 day ago
1
NVIDIA/open-gpu-kernel-modules #472

Suspend doesn't work when PreserveVideoMemoryAllocations is …

### NVIDIA Open GPU Kernel Modules Version 525.85.05 ### Does this happen with the proprietary driver (of the same version) as well? Yes ### Operating System and Version Linux Mint 21.1…

Monsterovich updated 5 days ago
78
DDMAL/Rodan #1170

NVIDIA-SMI failed on vGPU instance

**Not related to local or staging** Same issue as in #1161 In short, we have ``` NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA …

homework36 updated 4 months ago
4
NVIDIA/nccl-tests #256

NCCL topology on the VM of H200

We have 2 H200 servers connected with the IP switch. We ran nccl_test and all_reduce_perf script worked well and had expected performance on the baremetal system. ``` fs@fs-207:~$ mpirun -np 16 -H 20…

wangjiafu0310 updated 1 month ago
4
vllm-project/vllm #9113

[Performance] In v0.6.2, when tp=1, TPOT becomes very slow f…

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` Defaulted container "kserve-container" out of: kserve-container,…

ashgold updated 3 weeks ago
3
ollama/ollama #7758

OLLAMA_MAX_QUEUE does not limit requests to the same model

### What is the issue? It seems that OLLAMA_MAX_QUEUE is not taking effect. My environment is Windows 11, and I have set OLLAMA_NUM_PARALLEL=1, set OLLAMA_MAX_QUEUE=1, but excessive requests are sti…

yyx1111 updated 1 day ago
1

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for nvidia

1000+ results
for nvidia