Closed HollowPrincess closed 5 years ago
Стемминг и лемматизация https://www.nltk.org/_modules/nltk/stem/snowball.html https://stackoverflow.com/questions/36182502/add-stemming-support-to-countvectorizer-sklearn https://www.programcreek.com/python/example/91271/nltk.stem
Обработка опечаток ? Нужно ли при 3-граммах? https://habr.com/ru/post/346618/
Бринк с.208
https://towardsdatascience.com/hacking-scikit-learns-vectorizers-9ef26a7170af