haesleinhuepf human-eval-bia issues

haesleinhuepf / human-eval-bia

Benchmarking Large Language Models for Bio-Image Analysis Code Generation

MIT License

9 stars 2 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

measure execution time of tests

#71 haesleinhuepf opened 1 week ago
0
count number of comments in generated code

#70 haesleinhuepf opened 1 week ago
0
Add claude 3.5 sonnet

#69 haesleinhuepf opened 1 week ago
1
add test-case for cell tracking measuring the speed of a cell and/or number of cells over time

#68 haesleinhuepf opened 2 weeks ago
0
Add gemini 1.5 flash benchmarking results

#67 haesleinhuepf opened 1 month ago
0
sampled+evaluated gpt4o, reran plotting notebooks

#66 haesleinhuepf opened 1 month ago
1
Add test for linear intensity profile

#65 marabuuu closed 4 weeks ago
0
rename codellama

#64 haesleinhuepf opened 2 months ago
1
Samples from recent open source models.

#63 haesleinhuepf closed 2 months ago
0
Samples from recent open source models.

#62 jkh1 closed 2 months ago
1
Samples lost due to error when sampling

#61 haesleinhuepf opened 2 months ago
1
Timeout when sampling

#60 haesleinhuepf opened 2 months ago
1
Benchmark against bigger open models

#59 dcfidalgo opened 2 months ago
6
revised main text

#58 haesleinhuepf closed 2 months ago
0
add seaborn plots

#57 nscherf closed 2 months ago
0
Rerun benchmark

#56 haesleinhuepf closed 2 months ago
0
Mistral benchmarking on blablador currently fails

#55 haesleinhuepf opened 2 months ago
0
add notebook that summarizes which libraries were used in generated code

#54 haesleinhuepf closed 2 months ago
0
What about future models learning from our resource?

#53 tischi opened 2 months ago
1
One model often fails with the same error

#52 haesleinhuepf opened 2 months ago
1
add notebook to summarize common failure reasons

#51 haesleinhuepf closed 2 months ago
0
add test to load a nifti image

#50 nscherf closed 2 months ago
1
rename read-... test case to open-... so that it fits better to others

#49 haesleinhuepf closed 2 months ago
0
added test-case for using aicsimageio, example data, requirements

#48 haesleinhuepf closed 2 months ago
0
Use pytorch and/or tensorflow?

#47 haesleinhuepf opened 2 months ago
0
Tex paper

#46 haesleinhuepf closed 2 months ago
0
add contributing guide and code of conduct

#45 haesleinhuepf closed 2 months ago
1
Add CONTRIBUTING guide and code of conduct

#44 haesleinhuepf closed 2 months ago
1
add dependencies which made some tests fail

#43 haesleinhuepf closed 2 months ago
1
add notebook for detecting missing requirements

#42 haesleinhuepf closed 2 months ago
3
add documentation how to add requirements

#41 haesleinhuepf closed 2 months ago
1
Add test for radial intensity profile

#40 tischi closed 2 months ago
1
How to deal with tests that fail due to missing dependencies

#39 tischi opened 2 months ago
6
Add fit_circle test

#38 tischi closed 2 months ago
0
Histogram equalization of an image

#37 haesleinhuepf opened 2 months ago
0
add test for linear intensity profile

#36 haesleinhuepf opened 2 months ago
0
add test for circle fitting?

#35 tischi closed 2 months ago
2
add test for radial intensity profile

#34 tischi closed 2 months ago
1
Add read_zarr test, add zarr dependency, add zarr example data

#33 tischi closed 2 months ago
1
count assert statements in our test-case notebooks

#32 haesleinhuepf opened 2 months ago
1
add bland-altman test case

#31 haesleinhuepf closed 2 months ago
0
added test-case combine-columns

#30 haesleinhuepf closed 2 months ago
0
fix typos in test-case names

#29 haesleinhuepf closed 2 months ago
0
add test-case binary_skeleton

#28 haesleinhuepf closed 2 months ago
0
add test-case for tiled image processing

#27 haesleinhuepf closed 2 months ago
0
Benchmark gemini-1.5-pro and gemini ultra

#26 haesleinhuepf opened 2 months ago
0
Better data visualization

#25 haesleinhuepf closed 2 months ago
0
add test case for neuroimaging: load nifti file

#24 nscherf closed 2 months ago
3
add use case: tiled processing

#23 haesleinhuepf closed 2 months ago
0
add use case: use aicsimageio to load a file

#22 haesleinhuepf closed 2 months ago
0