lucaong / minisearch

Tiny and powerful JavaScript full-text search engine for browser and Node
https://lucaong.github.io/minisearch/
MIT License
4.64k stars 133 forks source link

Fix tokenizer behavior on multiple contiguous split characters #271

Closed lucaong closed 1 month ago

lucaong commented 1 month ago

Multiple contiguous spaces or punctuation characters should be clustered together when splitting