juncongmoo / pyllama

LLaMA: Open and Efficient Foundation Language Models
GNU General Public License v3.0
2.8k stars 312 forks source link

How to run 13B model in a single GPU just by inference.by? #84

Open statyui opened 1 year ago