openai / finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
MIT License
2.15k stars 503 forks source link

Where can I get these two datasets? #41

Closed ghost closed 5 years ago

ghost commented 5 years ago

def rocstories(data_dir, n_train=1497, n_valid=374): storys, comps1, comps2, ys = _rocstories(os.path.join(data_dir, 'cloze_test_val__spring2016 - cloze_test_ALL_val.csv')) teX1, teX2, teX3, _ = _rocstories(os.path.join(data_dir, 'cloze_test_test__spring2016 - cloze_test_ALL_test.csv'))

rodgzilla commented 5 years ago

You can get the datasets from this website.

ghost commented 5 years ago

I've tried to click 'this form', but the link provided(https://goo.gl/forms/aQz39sdDrO) is somehow unreachable.

rodgzilla commented 5 years ago

You will receive links to the dataset by email.

ghost commented 5 years ago

The link works by changing another browser.Thanks!