tairov / llama2.mojo

Inference Llama 2 in one file of pure 🔥
https://www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov
MIT License
2.09k stars 139 forks source link

How to do inference on GPUs #87

Closed toutouya closed 4 months ago

tairov commented 4 months ago

Hi @toutouya , unfortunately GPU inference is not yet supported by Mojo. We hope it get support soon