Open eu9ene opened 4 days ago
Related to #575. We discussed that it should be easy to populate the group_logs evals table row by row from different evaluation tasks.
We also discussed that it might be possible to generate this table as a report using the source evaluation data from the runs. Then we wouldn't need to duplicate metric publishing. We should explore this option too.
Related to #575. We discussed that it should be easy to populate the group_logs evals table row by row from different evaluation tasks.