Open mam10eks opened 1 year ago
We found the following case that is not handled well: a system retrieves results for only 1 topic while there are 50 topics, and the evaluation script still says that the run is valid.
We found the following case that is not handled well: a system retrieves results for only 1 topic while there are 50 topics, and the evaluation script still says that the run is valid.