Zyphra / Megatron-LM

Ongoing research training transformer models at scale
Other
0 stars 0 forks source link

[ENHANCEMENT] Replace periodic validation with eval harness calls #17

Open Quentin-Anthony opened 12 months ago

Quentin-Anthony commented 12 months ago

Once a standalone lm-eval-harness script is created in https://github.com/Zyphra/Megatron-LM/issues/11, we will replace the periodic validation loss check with a few important lm-eval-harness tasks like lambada.

It would be much more useful to see how those change over time rather than a random validation set, and it'll be tedious to run the standalone script for every checkpoint ourselves.