olivernn / lunr.js

A bit like Solr, but much smaller and not as bright
http://lunrjs.com
MIT License
8.96k stars 548 forks source link

Behavior of Special Characters. #112

Closed justinpathrose closed 9 years ago

justinpathrose commented 10 years ago

Hi ,

Not sure if this is the right place to discuss this. But desperately need a solution to a problem I am facing. I am using lunr.js in one of my projects. I am not able to properly handle special characters in my search. For eg, when searching in my documents with strings containing special characters like "who's" the search function gives an output even though document does not contain the word "who's". Are special characters added to the stop words. How can I avoid this. Thanks in advance.

olivernn commented 9 years ago

Sorry for the very late response to this issue.

Lunr should handle characters like apostrophes well, the pipeline will transform the token into what ends up in the index, a quick check shows that by default who's will be transformed to who'. This seems slightly strange and could possibly be handled better.

Could you put together a reduced test case that shows this behaviour that you think is incorrect and I can look into it further.

olivernn commented 9 years ago

Closing this as I need more information to be able to offer a bug fix. Please re-open if you can provide more information to help me reproduce.