Use a task metric map in lm_eval runner

microsoft / TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

MIT License

354 stars 31 forks source link

Use a task metric map in lm_eval runner #146

Closed pashminacameron closed 3 months ago

pashminacameron commented 3 months ago

Log results in raw form before doing any smart averaging so that the full results are always logged in case there are errors later. Use a task metric map to avoid having to use specific metric names in averaging code. Currently only supports tasks in the paper and MMLU. New tasks and the corresponding metric of interest can be added where necessary.