A powerful collection library for feature extraction and text cleaning using Unicode translations, regular expressions, natural language processing, large language models and more.
This PR is an initialization PR to merge the existing code into the NLPurify module. This is a part of the total required changes (#1), and the following changes and fixes are listed below.
This PR brings the following change(s):
:recycle: Legacy Code(s) are maintained under nlpurify/legacy unless all existing code are migrated.
Some of my codes are dependent on the existing structure. This should be made available unless all codes are migrated to the latest changes. Check issue details #5 for more information.
:scroll: Description
This PR is an initialization PR to merge the existing code into the
NLPurify
module. This is a part of the total required changes (#1), and the following changes and fixes are listed below.This PR brings the following change(s):
nlpurify/legacy
unless all existing code are migrated.nltk.stopwords
,lemmatizations
.fuzzywuzzy
.On successful merge, the PR Fixes #2 Fixes #4 Fixes #5