issues
search
gomate-community
/
rageval
Evaluation tools for Retrieval-augmented Generation (RAG) methods.
Apache License 2.0
81
stars
8
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Feature/batch compute
#110
bugtig6351
opened
2 days ago
2
add bert score
#109
bugtig6351
closed
4 days ago
1
Add distinct_1/2 metric
#108
moshesbeta
opened
5 days ago
0
Add distinct 1/2 metric
#107
moshesbeta
closed
5 days ago
0
Add metrics implementation
#106
moshesbeta
closed
5 days ago
1
Add hotpot qa
#105
LittleSunshineQi
closed
6 days ago
1
change tests units
#104
moshesbeta
closed
1 month ago
1
change the inputs of metrics and the calling methods in tests
#103
bugtig6351
closed
1 week ago
3
add webglm benchmark
#102
QianHaosheng
closed
1 month ago
0
add summary and snippet results
#101
QianHaosheng
closed
1 month ago
1
add alce asqa results
#100
QianHaosheng
closed
2 months ago
0
Feature/add benchmark results
#99
bugtig6351
closed
2 months ago
1
Feature/add benchmark results
#98
bugtig6351
closed
2 months ago
1
add llama2-7b-chat and solar-10.7b-instruct result
#97
bugtig6351
closed
2 months ago
1
add hotpotqa benchmark
#96
LittleSunshineQi
closed
2 months ago
1
add alce asqa benchmark
#95
QianHaosheng
closed
2 months ago
2
add generator metrics
#94
QianHaosheng
closed
2 months ago
1
add alce eli5 benchmark readme
#93
QianHaosheng
closed
2 months ago
0
modify eli5 benchmark
#92
QianHaosheng
closed
2 months ago
1
Add support for Chinese language evaluation
#91
Wenshansilvia
opened
3 months ago
0
move instructGPT to OpenAILLM
#90
bugtig6351
closed
3 months ago
1
update asqa readme
#89
bugtig6351
closed
3 months ago
1
fix gt_type
#88
henan991201
closed
3 months ago
1
add asqa benchmark
#87
bugtig6351
closed
3 months ago
1
Feature/add asqa benchmark
#86
bugtig6351
closed
3 months ago
0
add readme
#85
QianHaosheng
closed
3 months ago
1
update f1 readme
#84
henan991201
closed
3 months ago
1
Add embedding based ranking benchmark
#83
faneshion
opened
3 months ago
0
update readme
#82
henan991201
closed
3 months ago
1
supplement docs
#81
QianHaosheng
closed
3 months ago
0
fix a bug of bleu
#80
FBzzh
closed
3 months ago
0
add auto benchmark
#79
bugtig6351
closed
3 months ago
0
add alce eli5 benchmark
#78
QianHaosheng
closed
3 months ago
1
Feature/add openai model config
#77
bugtig6351
closed
3 months ago
0
Feature/add f1 ter chrf++
#76
henan991201
closed
3 months ago
2
OpenAILLM
#75
bugtig6351
closed
3 months ago
1
add answer_bleu.py
#74
FBzzh
closed
3 months ago
0
update readme
#73
faneshion
closed
4 months ago
0
add definition of task and meteric
#72
Wenshansilvia
closed
4 months ago
0
Data quality metrics
#71
faneshion
opened
4 months ago
0
Add auto_benchmak.py in benchmarks
#70
faneshion
closed
3 months ago
0
add disambig f1 score
#69
bugtig6351
closed
3 months ago
0
add citation metric
#68
QianHaosheng
closed
3 months ago
0
Hotfix/validate data refactor
#67
bugtig6351
closed
4 months ago
0
remove outdated comment
#66
bugtig6351
closed
4 months ago
0
hotfix/delete_additional_line
#65
Wenshansilvia
closed
4 months ago
0
ASQA dataset from huggingface: https://huggingface.co/datasets/din0s/asqa
#64
FBzzh
closed
4 months ago
0
List all potential test benchmarks
#63
faneshion
opened
4 months ago
3
Add ELI5 test benchmark
#62
faneshion
closed
3 months ago
1
init task
#61
Wenshansilvia
closed
4 months ago
1
Next