dreasysnail / POINTER

MIT License
112 stars 19 forks source link

About News Dataset #8

Open JiamingUWU opened 3 years ago

JiamingUWU commented 3 years ago

In your paper, The EMNLP2017 WMT News dataset5 contains 268,586 sentences, but there are lots of datasets in url http://www.statmt.org/wmt17/ and I have no sense which one is the dataset used in experiments. I'd be appreciated if you provide some details.

guoyinwang commented 3 years ago

We use the news data obtained from https://github.com/pclucas14/GansFallingShort/tree/master/real_data_experiments/data/news

JiamingUWU commented 3 years ago

sorry to disturb you. How is the data set divided into training set , dev set and test set?