Closed jbingel closed 6 years ago
Data is now loaded from data_augmented
, please put POS annotation in there :)
By the way, looking at the (German) data I also realised it's terribly badly sentence tokenised (just split on .
it seems, which especially for the German date format DD. Month YYYY
super often breaks sentences.)
Hey, I could see you pushed the augmented data, thanks! Are you still working on this? If I see things right, I think it would be better to include the POS data in a different way (integrating pos count columns in the *.tsv
files). I can do that if you're not currently working on it?
Sure, feel free!
Implement basic features: