waseem18 / node-rake

A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.
http://www.thabraze.me/node-rake/
MIT License
100 stars 20 forks source link

Word bounaries for unicode chars #20

Open sr9yar opened 6 years ago

sr9yar commented 6 years ago

\b are not working for languages other than english. Which makes it impossible to use at all. Simply removing them seems to work fine though.

waseem18 commented 6 years ago

\b are not working for languages other than english

@sr9yar I'm not sure if node-rake works for languages other than English and even if it works I'm not sure about it's effectiveness.

I'm speculating if removing \b is going to affect the performance of keyword generation.