triton-inference-server / onnxruntime_backend

The Triton backend for the ONNX Runtime.
BSD 3-Clause "New" or "Revised" License
125 stars 54 forks source link

build: Add WAR for CUDA 12.5 build issue (#257) #258

Open rmccorm4 opened 3 months ago

rmccorm4 commented 3 months ago

Bringing this to main branch as well since current main pipelines are targeting CUDA 12.5