Closed austinmw closed 5 months ago
Hi @austinmw, thanks for always reporting these bugs - unexpectedly the tasks_langs=["en"] does in fact not run filter datasets to only run on English tasks. Instead what it does is that it filters to all the "en" subsets of MTEB.
I agree that this is counter-intuitive and I am slowly working toward fixing this in the following utility function:
tasks = mteb.get_tasks(languages=["eng"])
evaluation = MTEB(tasks = tasks)
This will get you all tasks that contain English (but also other languages in case of multilingual datasets).
I address this (along with the bug you discovered in #604 so that once it is merged the code above should be what you are looking for.
Sounds good, thanks again!
@austinmw just letting you know that this is now the default mode of interaction with MTEB. The previous approach will raise a deprecation warning
Thanks!
Hi, I tried to run the following:
And got the following error:
Is this error expected for the configuration preferences I set? If so, would you mind please telling me how to essentially say "run all tasks for english"? Thanks!