issues
search
wzhouad
/
WPO
Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
Other
21
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
About dataset
#2
lkevinzc
closed
1 month ago
1
How to calculate equation 2 efficiently?
#1
peterjc123
opened
3 months ago
4