Create a class for preprocessing text data. Class should have methods for removing punctuation, adding stop characters, and fitting the text into different sized buckets. Class should also convert text vocabulary into integers, and output a list of sequences of these integers.
Run main function of preprocess.py. Function will take a few minutes to clean data, then will print a few lines of original data and cleaned counterparts.
Relies on having "twitter_text_emotion.csv" in a directory called "raw_data".
Create a class for preprocessing text data. Class should have methods for removing punctuation, adding stop characters, and fitting the text into different sized buckets. Class should also convert text vocabulary into integers, and output a list of sequences of these integers.