opea-project GenAIEval issues

opea-project / GenAIEval

Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination

Apache License 2.0

17 stars 28 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Add benchmark part into top README

#127 lvliang-intel opened 12 hours ago
0
update rag_eval readme

#126 daisy-ycguo closed 18 hours ago
0
fix bigcode version when python>=3.11

#125 lkk12014402 closed 1 day ago
0
doc: fix JSON example

#124 dbkinder closed 18 hours ago
2
benchmark in v0.9 stopped supporting k3s

#123 ksandowi opened 2 days ago
0
doc: fix JSON example

#122 dbkinder closed 2 days ago
3
add acc tuning script.

#121 lkk12014402 closed 2 days ago
0
enable namespace setting in evals/benchmark tool

#120 daisy-ycguo opened 4 days ago
0
Add VisualQnA evaluation support

#119 lvliang-intel opened 5 days ago
0
doc: add policy validation/removal to the platform optimization

#118 askervin closed 2 days ago
1
add audioqna asr wer eval scripts

#117 Spycsh closed 4 days ago
0
[ChatQnA] Support the replica tuning for ChatQnA

#116 Zhenzhong1 closed 3 days ago
1
Fix ragas error

#115 lkk12014402 closed 1 week ago
0
update llm-as-judge doc.

#114 lkk12014402 closed 1 week ago
0
[v1.0] Add docker metric support

#113 bjzhjing closed 1 week ago
0
Update CODEOWNERS

#112 kevinintel closed 2 days ago
0
fix issue because of ragas changes

#111 lkk12014402 closed 1 week ago
0
Add README for codegen acc test.

#110 yao531441 closed 1 week ago
0
doc: fix headings and indents

#109 dbkinder closed 1 week ago
0
doc: add title to new FaqGen README

#108 dbkinder closed 1 week ago
1
Fix rerank benchmark script

#107 gavinlichn closed 1 week ago
0
Add code owner

#106 gavinlichn closed 1 week ago
2
Support service-list for metrics collection in benchmark.py

#105 joshuayao closed 1 week ago
0
Support benchmark file for w/o rerank pipeline

#104 letonghan closed 1 week ago
0
Update configuration in benchmark README

#103 daisy-ycguo closed 1 week ago
0
Metrics files are not dumped if enabling "service-metric-collect" when using benchmark.py

#102 joshuayao closed 1 week ago
1
No guide for using benchmark tool behind proxy

#101 joshuayao opened 1 week ago
0
benchmark tools support metrics for Docker deployment

#100 xiguiw opened 1 week ago
2
Failed to install dependencies of OPEA benchmark tool if Python version is higher than 3.10

#99 joshuayao opened 1 week ago
2
Update chatqna input to fix input length

#98 letonghan closed 1 week ago
0
Missing dependencies in evals/benchmark/stresscli/requirements.txt

#97 joshuayao opened 1 week ago
0
add code owners

#96 kevinintel closed 1 week ago
0
Support microservice level benchmark

#95 lvliang-intel closed 1 week ago
0
Support bigcode eval for codegen v0.1

#94 yao531441 closed 1 week ago
2
Support P50, P90, P99 for next token latency

#93 lvliang-intel closed 1 week ago
0
Missing initial instructions in evals/benchmark/README (and others)

#92 dbkinder opened 2 weeks ago
0
Add FaqGen Accuracy scripts & Refine Ragas

#91 XinyaoWa closed 1 week ago
2
doc: fix heading level

#90 dbkinder closed 1 week ago
0
Align llm microservice parameters with end to end test

#89 lvliang-intel closed 2 weeks ago
0
Add CRAG benchmark

#88 minmin-intel closed 3 days ago
1
Support stresscli for codegen

#87 yao531441 closed 2 weeks ago
0
Fix microservice level benchmark issue

#86 lvliang-intel closed 2 weeks ago
0
Audioqna stress bug fix

#85 Spycsh closed 2 weeks ago
0
bump release version into v1.0

#84 chensuyue closed 2 weeks ago
0
Support benchmark for docker deployment

#83 lvliang-intel closed 2 weeks ago
0
refine rag eval docs

#82 lkk12014402 closed 2 weeks ago
2
Support stress scripts for CodeTrans

#81 letonghan closed 2 weeks ago
1
update for audio benchmark

#80 Spycsh closed 2 weeks ago
1
add more metrics for ragas.

#79 lkk12014402 closed 2 weeks ago
0
update for faq benchmark

#78 XinyaoWa closed 2 weeks ago
0