-
Site accepts html but only searches for tags. They need to be in the stopwords file or someway of recognizing these tags and ignoring them
-
import nltk from nltk.corpus import stopwords from sklearn.feature_extraction.text import TfidfVectorizer from sklearn.model_selection import train_test_split from sklearn.svm import SVC from sklearn.…
-
Hi,
It is really great that Sonic has built-in stopword lists but this time I want to apply my own stopword list.
I think it would be nice if you could override the build-in stopword lists throu…
-
### Is this a new bug?
- [X] I believe this is a new bug
- [X] I have searched the existing issues, and I could not find an existing issue for this bug
### Current Behavior
`nltk.download("punkt")`…
-
Pismo allows us to set our own stopwords. We should do so. The stopwords Pismo uses are not optimized for our purposes.
-
Add all synonyms of "said" to stopword list, and maybe all body parts (don't tell us anything useful)
-
-
Our open sourced stopwords file has a few duplicate words. Should clean it up. We should also acknowledge in the logs that we are loading stopwords from a custom file.
-
I just think it would be nice to have a functionality such that, we can provide stopwords file path in command line argument. It would be a good functionality to allow us to give our own stopwords fil…
-
### Initiative (Required)
GSSoC (Girl Script Summer of Code) 🌸
### Is your proposal related to a problem? Please describe.
The project aims to create a machine learning-based application capa…