DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.
137
stars
5
forks
source link
Only run model/provider as well as testdata checks on the host if the runtime is not containerized #290
Closed
Munsio closed 3 months ago