Closed nolanlawson closed 10 years ago
Thanks!
No prob!
I think the solution has a problem when indexing text like "A - B", it leads to an empty token which results in a broken (not searchable) index.
Yup, you're right, that's a bug. Will fix.
version 0.5.5 includes the fix for this issue.
Example:
"york-san" isn't a word, so it shouldn't be output by the tokenizer.
For precedence, Lucene's standard tokenizer also splits on hyphens, although it doesn't do it for product numbers like 31-5-6, which is not implemented here due to complexity.
Prompted by https://github.com/nolanlawson/pouchdb-quick-search/issues/3.