haesleinhuepf / human-eval-bia

Benchmarking Large Language Models for Bio-Image Analysis Code Generation
MIT License
13 stars 4 forks source link

count assert statements in our test-case notebooks #32

Open haesleinhuepf opened 2 months ago

haesleinhuepf commented 2 months ago

This PR contains:

Related github issue (if relevant): closes #0

Short description:

How do you think will this influence the benchmark results?

Why do you think it makes sense to merge this PR?

haesleinhuepf commented 2 months ago

I'm not 100% sure if this is the way to go. Don't merge this for now.