This PR adds a WDWDatasetReader and supporting classes to the pipeline, as well as unit tests.
I never used Scala before Tuesday, so let me know if any code i've written isn't idiomatic or there's a better way to do it.
The WhoDidWhat Dataset is a cloze dataset where the task is to predict an entity in a sentence (the "question", which is the concatenation of the "left context" and "right context" which refer to the text on either side of the blank to fill in) given an article / passage on the same subject as the sentence and several answer choices.
This PR adds a
WDWDatasetReader
and supporting classes to the pipeline, as well as unit tests.I never used Scala before Tuesday, so let me know if any code i've written isn't idiomatic or there's a better way to do it.
The WhoDidWhat Dataset is a cloze dataset where the task is to predict an entity in a sentence (the "question", which is the concatenation of the "left context" and "right context" which refer to the text on either side of the blank to fill in) given an article / passage on the same subject as the sentence and several answer choices.