Closed xoofx closed 4 years ago
I saw that it's the same in lunr.js (as you did a close port of it)
Using just space or hyphen seems so wrong to separate words, but maybe they expect a string to be squashed to these only? (so any other characters would have to be replaced by spaces before indexing?)
Can you write down a few cases of the string to index, what it gets tokenized as, and what you'd expect?
Closing as lunr is expecting only words and spaces/hyphen, I can workaround it on my side.
It seems that characters
/
(e.gdoc/advanc
) or(
(e.gbin(out
) are indexed with surrounding word characters.