issues
search
for-ai
/
m-rewardbench
Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings
https://m-rewardbench.github.io/
MIT License
15
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix link to dataset
#57
ljvmiranda921
closed
4 days ago
0
数据集在huggingface上找不到
#56
ljxzs-jjlzs
closed
4 days ago
1
Adding languages
#55
peregilk
opened
3 weeks ago
1
Add arXiv link
#54
ljvmiranda921
closed
1 month ago
0
Update README.md
#53
ljvmiranda921
closed
1 month ago
0
Add disagreement histogram
#52
ljvmiranda921
closed
1 month ago
0
Update plotting scripts
#51
ljvmiranda921
closed
1 month ago
0
Add histogram
#50
ljvmiranda921
closed
1 month ago
0
Add code for getting the histogram
#49
ljvmiranda921
closed
1 month ago
0
Add plot for NLLB vs Google Translate
#48
ljvmiranda921
closed
1 month ago
0
Update all charts
#47
ljvmiranda921
closed
1 month ago
0
Add code for MAPLE and for inner-annotator agreement
#46
ljvmiranda921
closed
1 month ago
0
Update plots and improve their setup
#45
ljvmiranda921
closed
1 month ago
0
Plot updates
#44
ljvmiranda921
closed
1 month ago
0
Update plot with some fixes
#43
ljvmiranda921
closed
1 month ago
0
Update plot_utils.py
#42
ljvmiranda921
closed
1 month ago
0
Fix not enclosed function
#41
ljvmiranda921
closed
1 month ago
0
Add new plots
#40
ljvmiranda921
closed
1 month ago
0
Update counts
#39
ljvmiranda921
closed
1 month ago
0
Add fix for rewardbench script
#38
ljvmiranda921
closed
1 month ago
0
Add English drop plots
#37
ljvmiranda921
closed
1 month ago
0
Add new plots
#36
ljvmiranda921
closed
1 month ago
0
Update analysis
#35
ljvmiranda921
closed
1 month ago
0
Added example command for translation
#34
RishabhMaheshwary
closed
2 months ago
0
Updated gtranslate with batch save and load
#33
RishabhMaheshwary
closed
2 months ago
0
Updated gtransalte to skip code and non-code text
#32
RishabhMaheshwary
closed
2 months ago
0
Optimize Open-Source transation pipeline
#31
ShayekhBinIslam
closed
1 month ago
0
Update README.md
#30
ljvmiranda921
closed
2 months ago
0
Update docs
#29
ljvmiranda921
closed
2 months ago
0
Add charts for english drop
#28
ljvmiranda921
closed
3 months ago
0
Add converters
#27
ljvmiranda921
closed
3 months ago
0
Add some charts
#26
ljvmiranda921
closed
3 months ago
0
Update agreement
#25
ljvmiranda921
closed
3 months ago
0
Google translate script
#24
RishabhMaheshwary
closed
3 months ago
1
Add --force_download for getting results
#23
ljvmiranda921
closed
3 months ago
0
Add link for checking agreement
#22
ljvmiranda921
closed
3 months ago
0
Create annotation check
#21
ljvmiranda921
closed
3 months ago
0
Gemma special case handled
#20
ShayekhBinIslam
closed
3 months ago
1
Add simple CLI for downloading and parsing results
#19
ljvmiranda921
closed
3 months ago
1
Adapted scripts for Llama3.1 and fixed generative flags.
#18
ShayekhBinIslam
closed
3 months ago
2
Support torch_bfloat16
#17
ljvmiranda921
closed
3 months ago
0
Few more bug fixes for the RM script
#16
ljvmiranda921
closed
3 months ago
0
Bug fixes
#15
ljvmiranda921
closed
3 months ago
0
Add rewardbench-full
#14
ljvmiranda921
closed
3 months ago
1
Add rewardbench copy
#13
ljvmiranda921
closed
3 months ago
0
Update evals pipeline to accommodate official multilingual eval set
#12
ljvmiranda921
closed
3 months ago
1
Quick updates on documentation and dependencies
#11
ljvmiranda921
closed
4 months ago
0
Translate preference pairs
#10
RishabhMaheshwary
closed
4 months ago
1
Include language in the prompt and some new features
#9
ljvmiranda921
closed
4 months ago
1
Add Generative RM pipeline
#8
ljvmiranda921
closed
4 months ago
2
Next