mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
https://mlabonne.github.io/blog/
Apache License 2.0
39.18k stars 4.14k forks source link

DPO with Axolotl #48

Open JohanWork opened 9 months ago

JohanWork commented 9 months ago

It is possible to perform DPO with Axolotl. If I were to create a notebook for DPO fine-tuning, do you think it would be suitable for your repository?

mlabonne commented 9 months ago

Hi Johan, thanks for your suggestion. I actually made one, will release it soon. Feel free to suggest improvements if you're interested, it's not perfect but it works haha.

On Thu, Feb 15, 2024, 20:09 JohanWork @.***> wrote:

It is possible to perform DPO with Axolotl. If I were to create a notebook for DPO fine-tuning, do you think it would be suitable for your repository?

— Reply to this email directly, view it on GitHub https://github.com/mlabonne/llm-course/issues/48, or unsubscribe https://github.com/notifications/unsubscribe-auth/ATL5EGUCTEEUJUIC2BRK62LYTZTOPAVCNFSM6AAAAABDK4LN4WVHI2DSMVQWIX3LMV43ASLTON2WKOZSGEZTOMZVHA2TSNI . You are receiving this because you are subscribed to this thread.Message ID: @.***>

JohanWork commented 9 months ago

aa nice, looking forward to it. Will do!

mlabonne commented 9 months ago

Released it here: https://twitter.com/maximelabonne/status/1759222499131199788 :)