mit-han-lab / TinyChatEngine

TinyChatEngine: On-Device LLM Inference Library
https://mit-han-lab.github.io/TinyChatEngine/
MIT License
647 stars 62 forks source link

LLaMA runtime support #1

Closed meenchen closed 1 year ago

meenchen commented 1 year ago

Adding FP32 reference implementation for LLaMA