haesleinhuepf / human-eval-bia

Benchmarking Large Language Models for Bio-Image Analysis Code Generation
MIT License
19 stars 11 forks source link

Report about system prompt in the paper #75

Open haesleinhuepf opened 2 months ago

haesleinhuepf commented 2 months ago

As indirectly suggested by @psobolewskiPhD in the image.sc thread: https://forum.image.sc/t/preprint-alert-and-call-for-contributions-llms-for-bio-image-analysis/98719/18?u=haesleinhuepf

psobolewskiPhD commented 2 months ago

Follow up work can explore the role of different system prompts! 😉

haesleinhuepf commented 2 months ago

Follow up work can explore the role of different system prompts!

That's the content of the next paper - or hopefully of 10 next papers ;-) We could use the benchmark to create some competition... a LLM-challenge =-)