buda-base / lucene-bo

Lucene analyzer for Tibetan
Apache License 2.0
12 stars 3 forks source link

There might be a search issue with ནཱ་/nA #42

Open JannTibetan opened 11 months ago

JannTibetan commented 11 months ago

The search engine seems to not recognize ནཱ་ in Tibetan unicode (but it does recognize nA in extended Wylie). I created a video to illustrate the issue: https://capture.dropbox.com/WvGXNINyxhKjKaE7

eroux commented 11 months ago

moving to lucene-bo

eroux commented 11 months ago

well, there is actually, thanks for spotting that!

I've fixed the code but unfortunately fixing it on the website will require a full re-indexing. I think the end of the year is actually a good time for that but doing it right now could be a bit disruptive...