How do you think will this influence the benchmark results?
I have not tested, but I suspect no LLM is currently capable to solve it.
Why do you think it makes sense to merge this PR?
Merging tables (correctly) is a common task for bio-image analysts, for example when multiple feature extraction algorithms are combined to build a large result table.
This PR contains:
sample_....jsonl
files)..._results.jsonl
files)Related github issue (if relevant): closes #0
Short description:
How do you think will this influence the benchmark results?
Why do you think it makes sense to merge this PR?