chenyuxin1999 / S-DPO

[NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"
https://arxiv.org/abs/2406.09215
21 stars 2 forks source link