basetenlabs / truss

The simplest way to serve AI/ML models in production
https://truss.baseten.co
MIT License
892 stars 64 forks source link

change gemm default to auto #1019

Closed joostinyi closed 3 months ago

joostinyi commented 3 months ago

:rocket: What

TRT-LLM 0.11.0 changed this default so we are following suit

:computer: How

:microscope: Testing