We are currently running evaluation for La Leaderboard in a version of Harness cloned from SomosNLP's fork, because it already had some tasks we needed. The directory is in GPFS at /gpfs/projects/bsc88/evaluation/leaderboard_eval/lm-evaluation-harness. We need to implement these tasks in this version of Harness and get rid of that clone to only use this one.
We are currently running evaluation for La Leaderboard in a version of Harness cloned from SomosNLP's fork, because it already had some tasks we needed. The directory is in GPFS at
/gpfs/projects/bsc88/evaluation/leaderboard_eval/lm-evaluation-harness
. We need to implement these tasks in this version of Harness and get rid of that clone to only use this one.