jiahe7ay / MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
327 stars 52 forks source link

添加DPO 相关代码 #24

Closed wtxfrancise closed 4 months ago

wtxfrancise commented 4 months ago

1.添加DPO相关代码 2.修改readme文件和对应图片