lfmatosm / embedded-topic-model

A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM
MIT License
85 stars 8 forks source link

[BUG] vectorizer.stop_words_ is not a property of the vectorizer #30

Open MaazBinMusa opened 1 month ago

MaazBinMusa commented 1 month ago

Describe the bug While preprocessing the documents using the utils, I encounter a bug which says stopwords is not an attribute of CountVectorizer

To Reproduce Steps to reproduce the behavior:

  1. pip install the module / clone the repo
  2. try to preprocess any documents


**Expected behavior**
a vocab and traindataset to return