buda-base / lucene-bo

Lucene analyzer for Tibetan
Apache License 2.0
12 stars 3 forks source link

lenient mode: hyphens as tshegs #34

Closed eroux closed 2 years ago

eroux commented 3 years ago

Some users are searching the website using alalc and one of the main issues is that alalc is using hyphens as syllable breakers while they are used in -i in ewts for reversed gigus. In lenient mode we should transform -([^i]) into $1