Alina-enni / lingdiggers

Project for the Building NLP Applications course
0 stars 0 forks source link

Step 4 #12

Closed Alina-enni closed 2 years ago

Alina-enni commented 2 years ago
  1. Have you noticed that not all words in the toy data were actually indexed by the code in the tutorial? Which ones? Would you like to index all words containing alpha-numerical characters? Can you solve that? The answer can be found here: https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html (Again, don't waste time on this one if it is too difficult, but return to it later.)