the extension of Din et al. (2015) of Artetxe et al. (2017, 2018a)
Make sure to know which datasets we have in the repo and if we need to gather some at other places.
This need to be documented in our paper (the fuzziness of which dataset is used in the paper vs code)
There is 2-3 ish different datasets: