issues
search
KwanWaiChung
/
M4LE
Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
MIT License
22
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
推理长度OOM问题
#8
Zhangchaoran000
opened
3 months ago
0
Why dureader testSet Use Acc metric instead of Rouge
#7
jarheadjoe
closed
3 months ago
2
will you report the results used for plotting Figure 3 ?
#6
jarheadjoe
closed
11 months ago
1
will you release results/all_result.csv that are consistent with those in the paper's
#5
jarheadjoe
closed
3 months ago
3
inference resume fails
#4
jarheadjoe
closed
11 months ago
1
drcd_semantic-single数据集尾部出现两个问题
#3
jarheadjoe
closed
1 year ago
1
是否开源数据生成代码
#2
jarheadjoe
closed
3 months ago
2
C3数据集出现两个“段落1”
#1
jarheadjoe
closed
1 year ago
1