Benchmark: display SD in living benchmark

biocypher / biochatter

Backend library for conversational AI in biomedicine

http://biochatter.org/

MIT License

51 stars 19 forks source link

Benchmark: display SD in living benchmark #163

Open slobentanzer opened 4 weeks ago

slobentanzer commented 4 weeks ago

As per #147, we record single scores and can calculate the per-run and between-run variance. However, since the full benchmark has not been run with a significant amount of iterations, this currently carries no information. Once we have run enough iterations, we can display the variance in the living benchmark as well.