Open DotaArtist opened 5 years ago
The target word suffix plus a number will cause the extraction to fail.
import flashtext _extractor = flashtext.KeywordProcessor() _extractor.add_keyword('地中海贫血') True _extractor.extract_keywords('地中海贫血') ['地中海贫血'] _extractor.extract_keywords('地中海贫血2') []
FlashText is designed to only match complete words (words with boundary characters on both sides)
https://arxiv.org/pdf/1711.00046.pdf
The target word suffix plus a number will cause the extraction to fail.