okuvshynov / slowllama

Finetune llama2-70b and codellama on MacBook Air without quantization
MIT License
431 stars 33 forks source link

Fp16 #4

Closed okuvshynov closed 9 months ago

okuvshynov commented 9 months ago
  1. Extract parameters to conf files.
  2. Support fp16 for storage and evaluation.