issues
search
opea-project
/
GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
Apache License 2.0
17
stars
28
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add benchmark part into top README
#127
lvliang-intel
opened
12 hours ago
0
update rag_eval readme
#126
daisy-ycguo
closed
18 hours ago
0
fix bigcode version when python>=3.11
#125
lkk12014402
closed
1 day ago
0
doc: fix JSON example
#124
dbkinder
closed
18 hours ago
2
benchmark in v0.9 stopped supporting k3s
#123
ksandowi
opened
2 days ago
0
doc: fix JSON example
#122
dbkinder
closed
2 days ago
3
add acc tuning script.
#121
lkk12014402
closed
2 days ago
0
enable namespace setting in evals/benchmark tool
#120
daisy-ycguo
opened
4 days ago
0
Add VisualQnA evaluation support
#119
lvliang-intel
opened
5 days ago
0
doc: add policy validation/removal to the platform optimization
#118
askervin
closed
2 days ago
1
add audioqna asr wer eval scripts
#117
Spycsh
closed
4 days ago
0
[ChatQnA] Support the replica tuning for ChatQnA
#116
Zhenzhong1
closed
3 days ago
1
Fix ragas error
#115
lkk12014402
closed
1 week ago
0
update llm-as-judge doc.
#114
lkk12014402
closed
1 week ago
0
[v1.0] Add docker metric support
#113
bjzhjing
closed
1 week ago
0
Update CODEOWNERS
#112
kevinintel
closed
2 days ago
0
fix issue because of ragas changes
#111
lkk12014402
closed
1 week ago
0
Add README for codegen acc test.
#110
yao531441
closed
1 week ago
0
doc: fix headings and indents
#109
dbkinder
closed
1 week ago
0
doc: add title to new FaqGen README
#108
dbkinder
closed
1 week ago
1
Fix rerank benchmark script
#107
gavinlichn
closed
1 week ago
0
Add code owner
#106
gavinlichn
closed
1 week ago
2
Support service-list for metrics collection in benchmark.py
#105
joshuayao
closed
1 week ago
0
Support benchmark file for w/o rerank pipeline
#104
letonghan
closed
1 week ago
0
Update configuration in benchmark README
#103
daisy-ycguo
closed
1 week ago
0
Metrics files are not dumped if enabling "service-metric-collect" when using benchmark.py
#102
joshuayao
closed
1 week ago
1
No guide for using benchmark tool behind proxy
#101
joshuayao
opened
1 week ago
0
benchmark tools support metrics for Docker deployment
#100
xiguiw
opened
1 week ago
2
Failed to install dependencies of OPEA benchmark tool if Python version is higher than 3.10
#99
joshuayao
opened
1 week ago
2
Update chatqna input to fix input length
#98
letonghan
closed
1 week ago
0
Missing dependencies in evals/benchmark/stresscli/requirements.txt
#97
joshuayao
opened
1 week ago
0
add code owners
#96
kevinintel
closed
1 week ago
0
Support microservice level benchmark
#95
lvliang-intel
closed
1 week ago
0
Support bigcode eval for codegen v0.1
#94
yao531441
closed
1 week ago
2
Support P50, P90, P99 for next token latency
#93
lvliang-intel
closed
1 week ago
0
Missing initial instructions in evals/benchmark/README (and others)
#92
dbkinder
opened
2 weeks ago
0
Add FaqGen Accuracy scripts & Refine Ragas
#91
XinyaoWa
closed
1 week ago
2
doc: fix heading level
#90
dbkinder
closed
1 week ago
0
Align llm microservice parameters with end to end test
#89
lvliang-intel
closed
2 weeks ago
0
Add CRAG benchmark
#88
minmin-intel
closed
3 days ago
1
Support stresscli for codegen
#87
yao531441
closed
2 weeks ago
0
Fix microservice level benchmark issue
#86
lvliang-intel
closed
2 weeks ago
0
Audioqna stress bug fix
#85
Spycsh
closed
2 weeks ago
0
bump release version into v1.0
#84
chensuyue
closed
2 weeks ago
0
Support benchmark for docker deployment
#83
lvliang-intel
closed
2 weeks ago
0
refine rag eval docs
#82
lkk12014402
closed
2 weeks ago
2
Support stress scripts for CodeTrans
#81
letonghan
closed
2 weeks ago
1
update for audio benchmark
#80
Spycsh
closed
2 weeks ago
1
add more metrics for ragas.
#79
lkk12014402
closed
2 weeks ago
0
update for faq benchmark
#78
XinyaoWa
closed
2 weeks ago
0
Next