DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.
128
stars
5
forks
source link
If results folder already exists, add suffix but don't overwrite or error #176
Closed
bauersimon closed 3 months ago