ludovicaschaerf / TMCI_Project

0 stars 0 forks source link

Update 2 #2

Open ludovicaschaerf opened 4 years ago

ludovicaschaerf commented 4 years ago

We are now ready to start with the topic modelling, which we are waiting for next class' explanation to implement. Currently, we have songs from 5 artists (we added 3 new ones) and we have a column (added to the original dataframe) that contains the bag of words corresponding to each lyric and including the 20 most popular bigrams. As we talked about in class, we stopped following the tutorial and the current code is all programmed by us.

Giovanni1085 commented 4 years ago

Next lab is public now, if helpful: https://github.com/Giovanni1085/AUC_TMCI_2019/blob/master/notebooks/13_Clustering_TopicModelling.ipynb.