bjerva / cwi18

Repository for https://www.aclweb.org/anthology/W18-0518/
Apache License 2.0
0 stars 0 forks source link

Basic features #6

Closed jbingel closed 6 years ago

jbingel commented 6 years ago

Implement basic features:

jbingel commented 6 years ago

Data is now loaded from data_augmented, please put POS annotation in there :) By the way, looking at the (German) data I also realised it's terribly badly sentence tokenised (just split on . it seems, which especially for the German date format DD. Month YYYY super often breaks sentences.)

jbingel commented 6 years ago

Hey, I could see you pushed the augmented data, thanks! Are you still working on this? If I see things right, I think it would be better to include the POS data in a different way (integrating pos count columns in the *.tsv files). I can do that if you're not currently working on it?

bjerva commented 6 years ago

Sure, feel free!