issues
search
infinigence
/
LVEval
Repository of LV-Eval Benchmark
MIT License
46
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Why all question-answer pairs in factrecall_en are the same?
#5
yhshu
opened
3 weeks ago
0
BAMBOO Benchmark没有中文数据集
#4
YanShuang17
opened
3 weeks ago
0
Updated Benchmark Results, like GPT-4o, LLaMA 3.1, and Qwen 2
#3
rgtjf
opened
2 months ago
1
想了解一下为什么多个answer会只算第一个的得分
#2
RubickH
opened
3 months ago
2
关于论文中的Multi-hop QA数据集
#1
cobraheleah
opened
7 months ago
1