huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences
https://huggingface.co/HuggingFaceH4
Apache License 2.0
4.53k stars 393 forks source link

Released model weights for ablations of KTO/IPO/DPO cannot be found #168

Open ChenDRAG opened 4 months ago

ChenDRAG commented 4 months ago

Hi @edbeeching , thanks for the great work in ablating KTO/IPO/DPO algorithms in #104 . I notice that in this referenced blog, it says the best performing model for each algorithm has been uploaded to the collection page. However, I cannot find these models.

Could you kindly provide these model weights? Thank you in advance.