tairov / llama2.mojo

Inference Llama 2 in one file of pure 🔥
https://www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov
MIT License
2.09k stars 139 forks source link

Add param to set threads #44

Closed tairov closed 10 months ago

tairov commented 10 months ago

Add param -j to set amount of threads for parallelize methods, keep default value = num_cores()