dimsum16 / dimsum-data

Data for the DiMSUM shared task at SEMEVAL 2016
http://dimsum16.github.io/
14 stars 5 forks source link

Extra blank lines in dimsum16.train #4

Closed nschneid closed 8 years ago

nschneid commented 9 years ago

These occur between sentences lowlands-73 and lowlands-76 (are 74 and 75 supposed to be there?), as well as before the start of the Reviews data. I removed them manually in 07db7a2 but just wanted to check that these aren't due to a preprocessing bug.

andersjo commented 9 years ago

They are due to extra line break in the original data file. Removing them now. Note that sentences > lowlands-73 will be renumbered as the gap will disappear.

nschneid commented 9 years ago

Thanks. Still a bunch of line breaks between ritter-787 and the Reviews data.