-
In PRs, in the CI Pipeline, the Vercel Deployment is marked green :heavy_check_mark: , but when clicking on "Details", we get a 404 error on the website.
![Screenshot from 2020-07-20 22-19-46](http…
-
Hi,
I am just curious about if it is doable I can directly upload the plot to my plotly dashboard once called hero.scatterplot()?
chart_studio.tools.set_credentials_file(username='username', api…
-
In [setup.cfg](https://github.com/jbesomi/texthero/blob/master/setup.cfg), should `parameterized>=0.7.4` be in **development** dependencies or **install_requires**?
As per the [documentation](https…
-
The `tfidf` function, under-the-hoods makes use of the [sklearn.feature_extraction.text.TfidfVectorizer](https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfVectoriz…
-
It would be cool to have parameters in the hero.clean() function to activate individual pre-processing steps instead of constructing a custom pipeline.
Proposal by @mxhofer
Notes: the solution s…
-
I think it would be nice to have a `fix_encoding` function in preprocessing to fix bad encoding in input text. We can build this using [ftfy](https://github.com/LuminosoInsight/python-ftfy).
Exampl…
-
**Problem:**
Given the sentence
> "E-I-E-I-O\nAnd on"
And the pipeline `pre.lowercase`, `pre.remove_stopwords`. Method `clean` returns
> "e--e--\n "
It should returns:
> "e-i-e-i-o\…
-
A feature involving topics detection for unsupervised learning (KMeans for ex) would be really welcome!
-
After the discussion on #78
We should add something like:
"To reduce blur in the images, width and height should have the same size, i.e the image should be squared"
-
Allow a custom stop-words list to `remove_stop_words` function
**Edit** Starting from v1.0.6, the function is called `remove_stopwords` (without the second underscore)