Kipok / NeMo-Skills

A pipeline to improve skills of large language models
https://kipok.github.io/NeMo-Skills/
Apache License 2.0
185 stars 41 forks source link

Change metrics logic to always report all available aggregations #184

Open Kipok opened 3 weeks ago

Kipok commented 3 weeks ago

Instead of accepting aggregation type, we should always return all available aggregations. It's probably also a good idea to automate "pass@k" and majority metric (might need some opt-in and light customization from each metric class to support it properly) and only ask to implement a "first" aggregation to make things simpler and reuse more code