Closed aliaamiri closed 2 years ago
To start off with, it is not unexpected that there may be some overlap between a stopword list and a sentiment lexicon; these kinds of word lists are built in different ways and for different purposes. So that is correct, yep.
Specifically for the Onix Text Retrieval Toolkit stopword list, I am sad to see that these folks (who were Lextek) seem to not be around anymore and they have taken their documentation with them. 😕 I'll see what I can find with more research.
Thank you so much for the clarification🙌.
I found Onix in this comprehensive list of stopwords lexicons. As you mentioned previously, these lists are very different in nature. Maybe, I should stick with a custom_made stopwords list.
Let me know if you have further questions!
This issue has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.
Some
stop_words
do not belong to the list. For example, there are somestop_words
that are present in sentiment lexicons:onix
lexicon "good" is a stop_word. I tried to check this, but the link provided in the help environment of RStudio does not have any content. So, I checked it in sentiment-lexicons: affin(value = 3), bing(sentiment = positive), loughran(sentiment = positve), nrc(sentiment = joy etc.)SMART
lexicon. But it is included in some sentiment-lexicons: afinn(value = 2), bing(sentiment = positive)And there are other examples too. Is there any source to check
onix
lexicon similar to this one forSMART
?