Open thehomebrewnerd opened 2 years ago
The LSA primitive applies a cleaning step that removes stop words. Currently this is hard-coded to remove English stop words:
swords = set(nltk.corpus.stopwords.words("english"))
The primitive should be updated to allow users to specify other languages that are supported by nltk so the primitive functions properly on natural language columns that are not in English.
The LSA primitive applies a cleaning step that removes stop words. Currently this is hard-coded to remove English stop words:
The primitive should be updated to allow users to specify other languages that are supported by nltk so the primitive functions properly on natural language columns that are not in English.