openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
15.01k stars 2.61k forks source link

Music evals #138

Open bhack opened 1 year ago

bhack commented 1 year ago

It could be interesting to explore if we could use MusPy to add some text/symbolic music evals.

/cc @salu133445

bhack commented 1 year ago

See also https://github.com/dvruette/figaro