haesleinhuepf / human-eval-bia

Benchmarking Large Language Models for Bio-Image Analysis Code Generation
MIT License
19 stars 11 forks source link

test cases suggestions #99

Open pr4deepr opened 2 weeks ago

pr4deepr commented 2 weeks ago

Hi @haesleinhuepf In terms of test cases:

Are these of interest?

Cheers Pradeep

pr4deepr commented 2 weeks ago

Another one could be composing affine transformations.

haesleinhuepf commented 2 weeks ago

Awesome, thanks for the input!

The others are great ideas too! Feel free to send a PR! :-)

And one hint: When implementing polygon, points and also affine-transforms: Make sure to build in some tolerance in the assert-statements so that multiple implementations (with slightly different results) can pass the tests.