replicate / cog-triton

A cog implementation of Nvidia's Triton server
Apache License 2.0
11 stars 0 forks source link

Merge nvidia-*-cu12 python with nix's cudaPackages #31

Closed yorickvP closed 4 months ago

yorickvP commented 4 months ago

Tested and working. Pushed to cache: