GenBench / genbench_cbt_2023

The official Genbench Collaborative Benchmarking Task repository 2023 (Archived)
Other
14 stars 18 forks source link

Extra arguments for hf dataset loading #32

Open anmoisio opened 1 year ago

anmoisio commented 1 year ago

Hi,

I would need to pass extra keyword arguments for the load_dataset function of the datasets library, like this:

from datasets import load_dataset
load_dataset("europarl_bilingual", lang1="en", lang2="fi")

But it seems this is not supported by the API. Is there a way to load this dataset using the API?

kazemnejad commented 1 year ago

Thanks for bringing up this feature. At this point, we plan to add in the final version. In the meantime, you can override Task._load_data_source() method in your Task implementation to add this.

https://github.com/GenBench/genbench_cbt/blob/caa63fbf37733c76b218db5fa08bb2e67ac07b6c/src/genbench/task.py#L466

anmoisio commented 1 year ago

Okay thanks for the help!