red-bin / metadata_grapher

2 stars 1 forks source link

Munging & EDA in R #1

Closed dataders closed 5 years ago

dataders commented 5 years ago

Read the Excel files into one giant dataframe for cleaning. Added an index and file column for reference Added a department column (stripped from file name) For the columns sender, to, cc, bcc:

Resultant csv is uploading to Kaggle as wel speak.

red-bin commented 5 years ago

This will be very helpful for anyone who might want to approach the project in R!

When you get a chance, could you make a link to the kaggle dataset? That should then be put into a README. (I'll make an issue to create that).