buda-base / lucene-bo

Lucene analyzer for Tibetan
Apache License 2.0
12 stars 3 forks source link

normalizing more common Sanskrit stacks #33

Open eroux opened 3 years ago

eroux commented 3 years ago
brunogml commented 3 years ago

Some cases found in the rnying rgyud e-texts:

eroux commented 10 months ago

we should add all the variants of lotsawa:

and probably additionally: