Closed teetone closed 2 years ago
In addition to these tasks, please also run for gsm8k, legal_support, babi, lsat, dyck, entity_matching , human_eval , APPS , entity_matching, entity_data_imputation. Thanks
Closed since we are doing for all reasoning based on https://github.com/stanford-crfm/benchmarking/commit/ced441bb902aa2c615533aba3ccdac8ddb2c61c7
Is it for math , numeracy , synthetic_reasoning and synthetic_reasoning_natural?