davmacario / MDI-LLM

Implementation of Model-Distributed Inference for Large Language Models, built on top of LitGPT
MIT License
3 stars 2 forks source link

Add Llama training script #34

Closed davmacario closed 5 months ago

davmacario commented 5 months ago

Added the possibility of defining and training any model with Llama architecture.

It is also possible to resume training/finetune both local models or pre-trained ones (from the Huggingface Hub).