QwenLM / Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
2.94k stars 192 forks source link

Add DPO training #149

Closed yibomiao closed 2 weeks ago

yibomiao commented 2 weeks ago

Add DPO training