okuvshynov / slowllama

Finetune llama2-70b and codellama on MacBook Air without quantization
MIT License
448 stars 34 forks source link

Slow service #2

Closed okuvshynov closed 1 year ago