-
The aim of this issue is to discuss and understand when `tokenize` should happen in the pipeline.
The current solution is to apply `tokenize` once the text has already been cleaned, either with `c…
-
(Edit)
Add a function `hero.infer_lang(s)` (a suggestion for a better function name is more than welcomed!) that given a Pandas Series finds for each row the respective language.
**Implementatio…
-
## Why
推薦・機械学習勉強会は、推薦や機械学習、その周辺技術を通じてサービスを改善することにモチベーションのある人達の集まりです。ニュースやブログから論文まで、気になったものについてお互い共有しましょう!
発信のため、ここは **public** にしてあります。外部からの参加をご希望の方は松村(https://twitter.com/yu__ya4) まで DM を送るか、Wa…
-
**Submitting author:** @PetrKorab (Petr Koráb)
**Repository:** https://github.com/PetrKorab/Arabica
**Branch with paper.md** (empty if default branch): main
**Version:** 1.7.1
**Editor:** @oliviaguest…
-
Hi,
I have a bug with nstallation of texthero and gensim on google colab.
Collecting gensim==3.8.3
Using cached gensim-3.8.3.tar.gz (23.4 MB)
Preparing metadata (setup.py) ... done
Requirem…
-
If you haven’t already, check out our [contributing guidelines](https://github.com/Expensify/ReactNativeChat/blob/main/contributingGuides/CONTRIBUTING.md) for onboarding and email contributors@expensi…
-
When I am trying to import texthero it throws and error saying :
ModuleNotFoundError: No module named 'gensim.sklearn_api'
I tried installing gensim and upgraded it but nothing seems to work. Any …
-
-
I have text in a dataframe and was adding it in like this:
```python
for text in df['text']:
mdl.add_doc(text.strip().split())
```
This works fine
However, when I tried to remove stopw…
-
just want to know if I should spend time learning this package, seems like it isn't being maintained anymore