nlp-uoregon / trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Apache License 2.0
724 stars 99 forks source link

trankit2conllu() does not write sentence texts: # text = ... #85

Open jerekupari opened 5 months ago

jerekupari commented 5 months ago

I noticed that ConlluEditor will not open conllu data written by the trankit2conllu() function, because the sentence texts # text = Hello! . . . # text = This is Trankit. are missing. (https://github.com/nlp-uoregon/trankit/releases/tag/v1.1.0)

Also trankit2conllu() does not add the SpaceAfter=No to tokens before punctuation etc. But that only generates a red warning in ConlluEditor.