vi3k6i5 / flashtext

Extract Keywords from sentence or Replace keywords in sentences.
MIT License
5.57k stars 598 forks source link

Some Problem When "For detecting Word Boundary currently any character other than this \w [A-Za-z0-9_] is considered a word boundary." #113

Closed SCismycat closed 4 years ago

SCismycat commented 4 years ago

Problem

Hi,I am china develop.Thanks to open source. I have a keyword in a sentence, like "人民日报1报道了相关事件", the "人民日报" is a keyword in my case,but the number "1" has been detection not as the word Boundary , so the result not contains "人民日报" keyword;

Solve

I can use regex preprocess mask the number bofore my sentence as input,so this is a bug or not? would you please fix this? or this tools not support? thanks