cuda-graph Search Results

1000+ results
for cuda-graph

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/xformers #1138

vllm 0.6.3 createLLM error TypeError: autotune() got an unex…

# 🐛 Bug from vllm import LLM, SamplingParams llm = LLM(model=model_dir,enforce_eager=True) then ``` File d:\my\env\python3.10.10\lib\site-packages\xformers\ops\fmha\_triton\splitk_kernels.…

xiezhipeng-git updated 1 month ago
4
chengzeyi/stable-fast #84

Enable cuda graph error in `StableDiffusionControlNetReferen…

Hi! Thanks for your amazing work. I tested several pipelines and the speed of this framework is truly impressive🔥 However, I have encountered an issue when using the stable-fast setting with the `e…

GoGiants1 updated 10 months ago
6
pytorch/executorch #6685

Cannot export quantized resnet50_clip.openai model to Edge d…

### 🐛 Describe the bug After quantizating the resnet50_clip.openai model with torch.ao quantization, last step `exir.to_edge()` fails quite often, not only with this model, but with many others: `…

corehalt updated 2 weeks ago
2
gameltb/ComfyUI_stable_fast #16

compare cuda graph true or false

Hi. I just wondering why I choose cuda graph as false, then I compared without stable_fast node.. then the last one is faster?? this is cuda graph as false: ![image](https://github.com/gameltb/…

yishuaidu updated 11 months ago
1
JuliaGraphs/GraphNeuralNetworks.jl #134

status of sparse arrays support on gpu

Once https://github.com/JuliaGPU/CUDA.jl/pull/1380 is merged, we should be able to have first-class support for the `graph_type=:sparse`. Therefore PR #66 should be revamped with latest cuda updates. …

CarloLucibello updated 1 month ago
4
Lightning-AI/lightning-thunder #727

Tensor.item should be disallowed in the CUDA Graphs region

*Note*: If you have a model or program that is not supported yet but should be, please use the program coverage template. ## 🐛 Bug ```py import thunder import torch def func(x): return…

IvanYashchuk updated 3 months ago
1
intel/llvm #15000

Graph/RecordReplay/usm_fill.cpp timeout on CUDA CI for unrel…

### Describe the bug The Graph/RecordReplay/usm_fill.cpp test has been observed to timeout in CUDA CI for unrelated changes. For example, see https://github.com/intel/llvm/pull/14985. ``` TIMEOUT…

steffenlarsen updated 3 months ago
1
microsoft/onnxruntime #20050

"trt_cuda_graph_enable" bug in tensorrt.

### Describe the issue I'm using onnx-tensorrt. When I enable the trt_cuda_graph_enable like this: ![image](https://github.com/microsoft/onnxruntime/assets/67405690/0f239de5-f995-43df-aa8a-805674…

hy846130226 updated 7 months ago
8
mlc-ai/mlc-llm #3043

[Bug] internlm2_5-20b-q0f16-MLC模型对话胡言乱语

## 🐛 Bug ![胡言乱语](https://github.com/user-attachments/assets/4f446294-a903-412d-ad98-987d0f04a60a) ## To Reproduce Steps to reproduce the behavior: 1. 编译 mlc_llm compile /path/to/internl…

l241025097 updated 1 week ago
2
NVIDIA/TensorRT #4234

State of affairs for NestedTensor (NJT) inference?

PyTorch now has some support for representing varlen sequences. It is supported to some extent by HF: - https://medium.com/pytorch/bettertransformer-out-of-the-box-performance-for-huggingface-transfor…

vadimkantorov updated 3 weeks ago
5

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for cuda-graph

1000+ results
for cuda-graph