replicate / cog-triton

A cog implementation of Nvidia's Triton server
Apache License 2.0
12 stars 0 forks source link

Update tensorrt-llm to v0.9.0 #33

Closed yorickvP closed 6 months ago

yorickvP commented 7 months ago

This changes:

We have to update the triton_model_repo files now. I've updated triton_templates from the new trtllm backend.