clanghout / twitch-classification

mmsr twitch genre classification based on chat
0 stars 0 forks source link

Extract database into csv #5

Closed mpasterkamp closed 7 years ago

mpasterkamp commented 7 years ago

Extract the messages from the postgress database, assign genre labels based on the label_dictionary file and save it into a csv

Keraito commented 7 years ago

Just an FYI: my data.csv went from ~443KB (yours) to around 28.6 MB.

mpasterkamp commented 7 years ago

I did not expect the file sizes to become this big, I quess we would have to opt for another method to share the data and gitignore the data