-
## Context
It appears that the nltk data is getting re-downloaded (or at least attempted) each time a new post is created.
I'm not sure why it's trying to download the data so many times?
```
…
-
## Description
Edge Case : Since keywords are mainly made by avoiding stopwords, for some cases the keywords extracted do not interpret the meaning of the text exactly.
For example : If text is - …
-
We need a way for users to customize the stopwords list and or swap in their own for use by the various NLP processes that check a stopwords list. @ericleasemorgan I think this enahncement relies on u…
-
I liked the `texthero`, and I want to contribute in somehow.
First, I want to discuss something that boring me - stopwords..
**Problem** - I want to deploy a solution without the `spacy` stopwords…
-
Also i have very serious issue with keyphrases on KeyBERT. for example if i add "climate integrated services" to stopword list then since word have 3 syllables its considered as a phrase…
-
Thanks for making this library! With both version 0.2.0 and 0.2.1 i get out of bounds errors for some of my queries. Here's a full stack trace:
```
Traceback (most recent call last):
File "/Use…
-
I am trying to remove some common words from my Swedish corpus, apart from the Snowball-stopwords, but the textProcessor keeps missing them. I've tried both to create a character vector including the …
-
It will be great to manage **stopwords** via an API call instead of relying on putting a file down on all of the Elasticsearch nodes, as it is supported now for synonyms.
https://www.elastic.co/guide…
-
Currently, the stopwords page only shows the top 500 most frequent words (from the entire corpus) in the table. So users can only search and stop/unstop within those 500 words. We should support the a…
-
first, there are a lot of old/literary conjugations of the auxiliary verbs. it's a lot of computation for words rarely used in modern french. but the problem is really that some words are wrong. _été_…