Closed awaelchli closed 3 weeks ago
Adds a new column to the config_hub/finetune/README.md with automated benchmarks as a follow up to #1337. The new column "Multitask score" covers MMLU at the moment. More categories will be added in the future.
config_hub/finetune/README.md
The following settings were used to run MMLU:
litgpt evaluate --checkpoint_dir ... --batch_size 4 --device cuda --dtype bfloat16 --tasks mmlu ...
Also removes the "Dataset" and "Precision" columns as they are constant, to make space.
Adds a new column to the
config_hub/finetune/README.md
with automated benchmarks as a follow up to #1337. The new column "Multitask score" covers MMLU at the moment. More categories will be added in the future.The following settings were used to run MMLU:
Also removes the "Dataset" and "Precision" columns as they are constant, to make space.