replicate / cog-triton

A cog implementation of Nvidia's Triton server
Apache License 2.0
11 stars 0 forks source link

Update tensorrt-llm to v0.9.0 #33

Closed yorickvP closed 3 months ago

yorickvP commented 4 months ago

This changes:

We have to update the triton_model_repo files now. I've updated triton_templates from the new trtllm backend.