Nthiki / nlp-sdg

1 stars 12 forks source link

Twitter cleaning data and sentiment analysis #23

Closed Luka-Explore closed 2 years ago

Luka-Explore commented 2 years ago

This adds a function that takes in twitter data set (in pandas dataframe) and removes strings/stop words in the text column and creates another column in the dataframe called 'clean_text' and saves the data in the intermediate layer.

I checked by adding a toy twitter data set in the same format that team b is scraping their data.