issues
search
haesleinhuepf
/
human-eval-bia
Benchmarking Large Language Models for Bio-Image Analysis Code Generation
MIT License
9
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
measure execution time of tests
#71
haesleinhuepf
opened
1 week ago
0
count number of comments in generated code
#70
haesleinhuepf
opened
1 week ago
0
Add claude 3.5 sonnet
#69
haesleinhuepf
opened
1 week ago
1
add test-case for cell tracking measuring the speed of a cell and/or number of cells over time
#68
haesleinhuepf
opened
2 weeks ago
0
Add gemini 1.5 flash benchmarking results
#67
haesleinhuepf
opened
1 month ago
0
sampled+evaluated gpt4o, reran plotting notebooks
#66
haesleinhuepf
opened
1 month ago
1
Add test for linear intensity profile
#65
marabuuu
closed
4 weeks ago
0
rename codellama
#64
haesleinhuepf
opened
2 months ago
1
Samples from recent open source models.
#63
haesleinhuepf
closed
2 months ago
0
Samples from recent open source models.
#62
jkh1
closed
2 months ago
1
Samples lost due to error when sampling
#61
haesleinhuepf
opened
2 months ago
1
Timeout when sampling
#60
haesleinhuepf
opened
2 months ago
1
Benchmark against bigger open models
#59
dcfidalgo
opened
2 months ago
6
revised main text
#58
haesleinhuepf
closed
2 months ago
0
add seaborn plots
#57
nscherf
closed
2 months ago
0
Rerun benchmark
#56
haesleinhuepf
closed
2 months ago
0
Mistral benchmarking on blablador currently fails
#55
haesleinhuepf
opened
2 months ago
0
add notebook that summarizes which libraries were used in generated code
#54
haesleinhuepf
closed
2 months ago
0
What about future models learning from our resource?
#53
tischi
opened
2 months ago
1
One model often fails with the same error
#52
haesleinhuepf
opened
2 months ago
1
add notebook to summarize common failure reasons
#51
haesleinhuepf
closed
2 months ago
0
add test to load a nifti image
#50
nscherf
closed
2 months ago
1
rename read-... test case to open-... so that it fits better to others
#49
haesleinhuepf
closed
2 months ago
0
added test-case for using aicsimageio, example data, requirements
#48
haesleinhuepf
closed
2 months ago
0
Use pytorch and/or tensorflow?
#47
haesleinhuepf
opened
2 months ago
0
Tex paper
#46
haesleinhuepf
closed
2 months ago
0
add contributing guide and code of conduct
#45
haesleinhuepf
closed
2 months ago
1
Add CONTRIBUTING guide and code of conduct
#44
haesleinhuepf
closed
2 months ago
1
add dependencies which made some tests fail
#43
haesleinhuepf
closed
2 months ago
1
add notebook for detecting missing requirements
#42
haesleinhuepf
closed
2 months ago
3
add documentation how to add requirements
#41
haesleinhuepf
closed
2 months ago
1
Add test for radial intensity profile
#40
tischi
closed
2 months ago
1
How to deal with tests that fail due to missing dependencies
#39
tischi
opened
2 months ago
6
Add fit_circle test
#38
tischi
closed
2 months ago
0
Histogram equalization of an image
#37
haesleinhuepf
opened
2 months ago
0
add test for linear intensity profile
#36
haesleinhuepf
opened
2 months ago
0
add test for circle fitting?
#35
tischi
closed
2 months ago
2
add test for radial intensity profile
#34
tischi
closed
2 months ago
1
Add read_zarr test, add zarr dependency, add zarr example data
#33
tischi
closed
2 months ago
1
count assert statements in our test-case notebooks
#32
haesleinhuepf
opened
2 months ago
1
add bland-altman test case
#31
haesleinhuepf
closed
2 months ago
0
added test-case combine-columns
#30
haesleinhuepf
closed
2 months ago
0
fix typos in test-case names
#29
haesleinhuepf
closed
2 months ago
0
add test-case binary_skeleton
#28
haesleinhuepf
closed
2 months ago
0
add test-case for tiled image processing
#27
haesleinhuepf
closed
2 months ago
0
Benchmark gemini-1.5-pro and gemini ultra
#26
haesleinhuepf
opened
2 months ago
0
Better data visualization
#25
haesleinhuepf
closed
2 months ago
0
add test case for neuroimaging: load nifti file
#24
nscherf
closed
2 months ago
3
add use case: tiled processing
#23
haesleinhuepf
closed
2 months ago
0
add use case: use aicsimageio to load a file
#22
haesleinhuepf
closed
2 months ago
0
Next