-
I'm currently processing a list of 100k+ texts. Regex is incredibly slow for this so I thought FlashText would be perfect. I'm unable to use FlashText to replace the '_' with ' '
`>>> text = 'the_q…
-
Hi,
Tried FlashText. Unfortunately, it picks up the wrong standardised name.
For example, if I have two different **values** (**standardised name** ) for **unclean name** i.e **"able" and "unable"…
-
Maybe I missed something, but I do not understand what the advantage of using the tie dictionary is over using a hashing dictionary is. Can you tell me what I am missing?
Let K denote the number of…
-
```python
len("İ") # 1
```
```python
len("İ".lower()) # 2
```
this will cause string index out of range flashtext.
-
-
Hi,
I´m loading 4,6 million keywords + their replacements into flashtext.
The raw data in a pandas dataframe consumes approx. 1 GB RAM, profiled with pd.DataFrame.memory_usage(True, True) and gu…
-
The current version in PyPi is `flashtext 2.7`, which was Released: Feb 16, 2018
There has been quite a few improvements and fixes since then, are you planning on releasing a new version soon?
-
I do not know if this is an implementation issue or a feature.
Full sub-patterns are not matched if the text contains the pattern.
### Test code
```go
package main
import (
"fmt"…
-
The first cell could be replaced by -
```
!pip install gensim
!pip install git+https://github.com/boudinfl/pke.git
!python -m spacy download en
!pip install bert-extractive-summarizer --upgrade…
-
in python 2.7:
`from flashtext import KeywordProcessor`
`keyword_processor = KeywordProcessor()`
`keyword_processor.add_keyword(u'北京')`
`keyword_processor.add_keyword(u'欢迎')`
`keyword_processor…