issues
search
haozheji
/
exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
https://arxiv.org/abs/2402.00856
MIT License
44
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Calculating reward model win rates to reproduce experiments
#5
sdsfas12
closed
2 months ago
2
Could you please relase the model checkponts?
#4
AGTSAAA
opened
4 months ago
1
Thank you for your work
#3
AGTSAAA
closed
4 months ago
1
The DPO loss implementation seems incomplete
#2
peterjc123
closed
5 months ago
8
code release
#1
Michelleable
closed
6 months ago
1