Customizable loss function & inference step?

Lightning-AI / litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

https://lightning.ai

Apache License 2.0

6.85k stars 726 forks source link

Customizable loss function & inference step? #1388

Closed Boltzmachine closed 1 week ago

Boltzmachine commented 1 week ago

The high level of customization offered by PyTorch Lightning is why I opted for PyTorch Lightning to train LMs over HuggingFace's stupid Trainer. However, it seems that this library once again consolidates everything into single command lines, thereby sacrificing a lot flexibility. Wonder if there is a way to customize the loss function and the inference process.

rasbt commented 1 week ago

Thanks for the feedback! The repo originally started out as self-contained scripts, and then we gradually transitioned this to the command line interface you are seeing now because this usage is the easiest for most people (incl. non-coders).

But like you said, it would be nice to also offer other ways to use the code, and eventually, we may create a Python interface. Thanks for suggesting!

Boltzmachine commented 1 week ago

Thanks for your feedback! Lightning is the best deep-learning training framework. Really hope it could be even better than huggingface when training LMs.