bnosac / udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
https://bnosac.github.io/udpipe/en
Mozilla Public License 2.0
209 stars 33 forks source link

Fix \n in misc #27

Open jwijffels opened 6 years ago

jwijffels commented 6 years ago

misc output in as.data.frame/udpipe_annotate always ends with \n due to use of std::istringstream I don't believe this is a big deal but at least it should be fixed in order to correctly reconstruct the original text based on SpaceAfter=No/SpacesAfter/SpacesBefore/SpacesInToken so that a from/to can be added. To be used alongside crfsuite.