tibetan-nlp / old-tibetan-corpus

Linguistically analyzed Old TIbetan documents and some tools for processing Old Tibetan text
MIT License
5 stars 1 forks source link

Replace [] with [---] or something more informative in OT Chronicle CONLLU #12

Closed heacu closed 3 years ago

heacu commented 3 years ago

Particularly on pages 27 and 28 of the OT Chronicle, what was converted to Unicode as [---] is showing up in BRAT and our CONLLU as []. These are important indicators, since they convey that the (partial) words on either side are not directly adjacent to each other; the source text has a gap.

I prefer [---] to [], but am also open to alternatives @FChrispz

FChrispz commented 3 years ago

@heacu I replaced [] with [---]