thundergolfer / reasoning-about-entailment-tensorflow

:school: Tensorflow implementation of "Reasoning About Entailment with Neural Attention"
MIT License
10 stars 5 forks source link

Get the new MultiNLI Dataset incorporated #5

Open thundergolfer opened 7 years ago

thundergolfer commented 7 years ago

From the webpage

Introduction The Multi-Genre Natural Language Inference (MultiNLI) corpus is a crowd-sourced collection of 433k sentence pairs annotated with textual entailment information. The corpus is modeled on the SNLI corpus, but differs in that covers a range of genres of spoken and written text, and supports a distinctive cross-genre generalization evaluation. The corpus is being used as the basis for the shared task of the RepEval 2017 Workshop at EMNLP in Copenhagen.

It shares its format with the SNLI so it should be easy to work in. They also say you can combine the two dataset during training, which could be interesting to try.

thundergolfer commented 7 years ago

7 starts this off

thundergolfer commented 7 years ago

This completion of this is blocked until the test section of the MultiNLI dataset is released.