cuda-runtime-api Search Results

1000+ results
for cuda-runtime-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #21700

[Performance] SetIntraOpNumThreads not take effect

### Describe the issue When using SetIntraOpNumThreads (1) and SetIntraOpNumThreads (10) on GPU, their inference time is similar, both around 30ms。I have already done warm-up before calculating the…

Zhangts98 updated 2 months ago
3
NVIDIA/TensorRT-LLM #1775

ChatGLM3 6B Multi-batch Failed with Error

### System Info - CPU: INTEL RPL - GPU Name: NVIDIA GTX 4090 - TensorRT-LLM: tensorrt_llm==0.11.0.dev2024060400 - Container Used: Yes and reproduced in Conda as well - Driver Version: 555.42.02 …

RobinJYM updated 1 week ago
4
vllm-project/vllm #10074

[Bug]: Outputs are different at separate runs

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A…

cermeng updated 2 weeks ago
1
vllm-project/vllm #9964

[Installation]: build on arm64 meet a error

### Your current environment ```text The output of `python collect_env.py` (pytorch_gpu) ➜ vllm git:(main) ✗ python collect_env.py Collecting environment information... WARNING 11-03 12:55:08 _c…

gongchangsui updated 3 weeks ago
1
tensorflow/tensorflow #57623

GPU memory usage depends on data size

Click to expand! ### Issue Type Bug ### Source binary ### Tensorflow Version 2.6-2.10 ### Custom Code Yes ### OS Platform and Distribution Linux Fedora 36 ### Mobil…

Dapid updated 1 week ago
9
NVIDIA-AI-IOT/nanosam #28

I encountered an issue while testing mlou using the method y…

I followed your process and converted the engine, as well as trained it myself, but I encountered this problem with both the original and the one I trained myself: loading annotations into memory... …

dujianshuo updated 2 months ago
1
vllm-project/vllm #10648

[Bug]: Llama 3.2 90b crash

### Your current environment The output of `python collect_env.py` ```text Collecting environment information... PyTorch version: 2.5.0+cu124 Is debug build: False CUDA used to build PyTorch…

yessenzhar updated 1 day ago
1
pytorch/pytorch #139602

[RFC]: Adding Triton Backend for Aten operators

# [RFC] Aten Operators in Triton for Multi-backend support ## Abstract This RFC discusses 1. the benefits and challenges of developing dispatch functions for Aten operators in Triton. 2. a…

iclementine updated 21 hours ago
3
vllm-project/vllm #7774

[Bug]: guide decoding lead to an incorrect function call arg…

### Your current environment The output of `python collect_env.py` ```text Collecting environment information... WARNING 08-22 15:09:07 _custom_ops.py:14] Failed to import from vllm._C with Mo…

sydnash updated 5 days ago
1
microsoft/onnxruntime #21888

[Performance] OrtValue created form OrtValue.ortvalue_from_…

### Describe the issue Hi, I use onnxruntime with IOBinding on python, our model has 7 inputs. when I use the OrtValue.ortvalue_from_numpy() to create the OrtValue, I find that the OrtValue.ptr() of …

liuzz13 updated 2 months ago
3

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for cuda-runtime-api

1000+ results
for cuda-runtime-api