octoml / mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
https://mlc.ai/mlc-llm
Apache License 2.0
5 stars 8 forks source link

[FP8] Bring fp8 support to OLLM tracking branch #240

Closed csullivan closed 7 months ago

csullivan commented 7 months ago

Parent reference: https://github.com/octoml/ollm/pull/433 Sibling reference: https://github.com/octoml/tvm/pull/61