cuda-runtime-api Search Results

1000+ results
for cuda-runtime-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cp2k/dbcsr #776

CUDA RUNTIME API error: DeviceSetLimit failed with error cud…

This PR seems to cause: > CUDA RUNTIME API error: DeviceSetLimit failed with error cudaErrorInvalidValue. ( tested on H100 device ) _Originally posted by @hfp in https://githu…

hfp updated 7 months ago
6
microsoft/onnxruntime #21690

onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRunti…

### Describe the issue onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : CopyTensorAsync is not implemented ### To reproduce build from source, cuda 12.3 ### Urgenc…

DefTruth updated 2 months ago
4
microsoft/onnxruntime #22212

CUDA Error cudaErrorUnsupportedPtxVersion with NVIDIA H800 (…

### Describe the issue ### Description: I'm encountering a CUDA error when attempting to execute a training process using ONNXRuntime GPU version 1.19.2 on a system with an NVIDIA H800 GPU (Comp…

xh-liu-tech updated 1 month ago
3
DeepVAC/libdeepvac #38

example 里带了cuda头文件，在纯CPU环境下编译不通过

RT，没有安装CUDA的机器，设置USE_CUDA=OFF, BUILD_ALL_EXAMPLES = OFF，因为仍然还是会编example，编译时报找不到头文件错误，具体： ``` In file included from /root/installs/libdeepvac/examples/src/test_resnet_benchmark.cpp:11: /usr/libtorch…

MoyanZitto updated 3 months ago
2
vllm-project/vllm #3338

vllm.engine.async_llm_engine.AsyncEngineDeadError: Task fini…

afol-apiserver-72b-1 | (RayWorkerVllm pid=3779) [E ProcessGroupNCCL.cpp:475] [Rank 1] Watchdog caught collective operation timeout: WorkNCCL(SeqNum=16487777, OpType=ALLREDUCE, NumelIn=195911680, Nume…

EchoShoot updated 4 weeks ago
1
pytorch/pytorch #134978

[RFC] A device-agnostic Python device memory related API des…

# Motivation This RFC aims to propose a design for a series of generic memory-related APIs tailored for stream-based accelerators to help users simplify the runtime code written for different devices…

guangyey updated 1 month ago
3
microsoft/onnxruntime #21651

Java GPU dependency of ONNX Runtime version 1.18 only suppor…

### Describe the issue I found that the Java dependency of onnxruntime-gpu 1.18.0 does not work properly on CUDA 11. Is there a parameter that can allow it to run correctly on CUDA 11? If not, could …

dongfeng3692 updated 2 months ago
1
NVIDIA/cutlass #1889

how to use nvrtc to run a sm90 kernel

Hi I want to use nvrtc to compile a sm90 kernel in runtime. The problem is that I don't have the kernel instance on host thus can't run to_underlying_arguments to get kernel param to launch the kerne…

mengchihe updated 5 days ago
10
microsoft/onnxruntime #20398

[Performance] Profiling on CUDA shows confusing values

### Describe the issue I tried profiling my UNET running on CUDA. And while looking at it in Perfetto, I saw that `SequentialExecutor::Execute` actually took only a fraction of the whole `model_run` …

Dr-Electron updated 2 weeks ago
3
microsoft/onnxruntime #22778

[Performance] Multiple instances of the same model are slowe…

### Describe the issue Running a model for N iterations in a single ONNX session is way faster than running the same model in 2 independent sessions, each session is run for N/2 iterations each. ¿W…

JorgeRuizITCL updated 2 weeks ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for cuda-runtime-api

1000+ results
for cuda-runtime-api