tibetan-nlp / old-tibetan-corpus

Linguistically analyzed Old TIbetan documents and some tools for processing Old Tibetan text
MIT License
5 stars 1 forks source link

Bulk assign lemmas to ADP, SCONJ and PART #13

Open heacu opened 3 years ago

heacu commented 3 years ago

@FChrispz mentioned that in OT Chronicle, at least, possibly also OT Annals, many ADP, SCONJ and PART are not lemmatized, where the identity of the lemma can be determined unambiguously from the POS tag and Case feature. In such cases, we should add the lemmas into the CONLLU files.