tensorflow / datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
https://www.tensorflow.org/datasets
Apache License 2.0
4.29k stars 1.54k forks source link

[data request] MultiNLI #19

Closed georgedahl closed 5 years ago

georgedahl commented 5 years ago

The Multi-Genre Natural Language Inference (MultiNLI) corpus is a crowd-sourced collection of 433k sentence pairs annotated with textual entailment information. The corpus is modeled on the SNLI corpus, but differs in that covers a range of genres of spoken and written text, and supports a distinctive cross-genre generalization evaluation. The corpus served as the basis for the shared task of the RepEval 2017 Workshop at EMNLP in Copenhagen.

Folks who would also like to see this dataset in tensorflow/datasets, please +1/thumbs-up so the developers can know which requests to prioritize.

adikolsur commented 5 years ago

Hi @georgedahl , I would like to work on adding to this dataset. I want to get started with this Project. Can you please help me get started and assign this issue to me.

Thanks

rsepassi commented 5 years ago

Hi @adikolsur, thanks for your interest! Actually this dataset was added by #80, so closing this issue. Let's find you another dataset to work on. Here are the current unassigned dataset requests.