cuda-graph Search Results

1000+ results
for cuda-graph

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/executorch #6817

GroupNorm fails export to Core ML

### 🐛 Describe the bug # Description I tried to export a PyTorch model using `torch.nn.GroupNorm` to Core ML with ExecuTorch, but encountered a `ValueError: Unsupported fx node aten_native_group_nor…

jakmro updated 2 weeks ago
1
pytorch/pytorch #140091

[Compiled_autograd] running nn.LayerNorm failed for torch.co…

### 🐛 Describe the bug When running a simple model including torch.nn.LayerNorm using deepspeed zero3 with torch.compile and [compiled_autograd](https://github.com/pytorch/tutorials/blob/main/interme…

yitingw1 updated 4 days ago
12
ggerganov/llama.cpp #10091

Bug: Cannot run larger than VRAM models with `GGML_CUDA_ENAB…

### What happened? `GGML_CUDA_ENABLE_UNIFIED_MEMORY` is documented as automatically swapping out VRAM under pressure automatically, letting you run any model as long as it fits within available RAM…

vlovich updated 1 week ago
2
intel/llvm #15000

Graph/RecordReplay/usm_fill.cpp timeout on CUDA CI for unrel…

### Describe the bug The Graph/RecordReplay/usm_fill.cpp test has been observed to timeout in CUDA CI for unrelated changes. For example, see https://github.com/intel/llvm/pull/14985. ``` TIMEOUT…

steffenlarsen updated 3 months ago
1
thewh1teagle/sherpa-rs #33

Speaker diarization errors

I am having a lot of trouble with speaker diarization across lots of different platforms and models. ```toml [target.'cfg(any(windows, target_os = "linux"))'.dependencies] sherpa-rs = { version =…

WilliamVenner updated 1 week ago
11
vllm-project/vllm #8881

[Bug]: assert len(self._async_stopped) == 0

### Your current environment The output of `python collect_env.py` ```text # For security purposes, please feel free to check the contents of collect_env.py before running it. python collect_e…

sfc-gh-zhwang updated 4 days ago
7
sgl-project/sglang #1729

[Bug][minimal reproducible demo] High variability across bat…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related iss…

FredericOdermatt updated 3 weeks ago
6
iree-org/iree #17249

Flaky CUDA runtime test cuda_graph_semaphore_submission_test…

The test `WaitAnyHostAndDeviceSemaphoresAndDeviceSignals` from `cuda_graph_semaphore_submission_test` seems to sometimes fail with ``` 12/412 Test #66: iree/hal/drivers/cuda/cts/cuda_graph_sema…

sogartar updated 7 months ago
1
pytorch/pytorch #139989

Support AC with graph break

### 🐛 Describe the bug As mentioned in this [blog](https://dev-discuss.pytorch.org/t/higher-order-operators-2023-10/1565), HigherOrderOperator does not support graph break inside the input/output fun…

yitongh updated 2 weeks ago
2
ggerganov/llama.cpp #10547

Eval bug: issues with draft model and Cline+VSCode

### Name and Version ``` .\llama-cli.exe --version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 2 CUDA devices: Device 0: NVIDIA…

Nepherpitou updated 2 days ago
8

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for cuda-graph

1000+ results
for cuda-graph