Add Llama training script

davmacario / MDI-LLM

Implementation of Model-Distributed Inference for Large Language Models, built on top of LitGPT

MIT License

3 stars 2 forks source link

Closed davmacario closed 5 months ago

davmacario commented 5 months ago

Added the possibility of defining and training any model with Llama architecture.

It is also possible to resume training/finetune both local models or pre-trained ones (from the Huggingface Hub).