cse442-fall-2019-offering / 442projects-team-rydat

442projects-team-rydat created by GitHub Classroom
1 stars 0 forks source link

Preprocess Class #37

Open dakota0064 opened 5 years ago

dakota0064 commented 5 years ago

Create a class for preprocessing text data. Class should have methods for removing punctuation, adding stop characters, and fitting the text into different sized buckets. Class should also convert text vocabulary into integers, and output a list of sequences of these integers.

dakota0064 commented 4 years ago

Task Test:

Relies on having "twitter_text_emotion.csv" in a directory called "raw_data".