DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.
137
stars
5
forks
source link
Check if the repository for the transpile task is valid before running the evaluation, so it is checked just once #306
Closed
ruiAzevedo19 closed 4 months ago
Part of #263