edublancas / song-lyrics

Exploratory Analysis of 200K+ song lyrics from the 1 million songs dataset
https://blancas.io/song-lyrics/
MIT License
5 stars 1 forks source link

Data for interactive format #26

Closed valmikkpatel closed 6 years ago

valmikkpatel commented 6 years ago

I need the data with following fields for the interactive part. Let me know if you need clarification.

name - artist_name - "The Beatles" songs - number of songs - "42" genres - array with top 3 genres - ["Pop", "Rock", "Progressive"] year_st - start_year - "1956" year_end - end year - "1969" sent_score - average sentiment score for the artist - "4.3" top_words - dictionary of top 5 words and count - {“love”: 250, “hate”: 390,…}

edublancas commented 6 years ago

Just pushed the script to generate the dataset (updated the boostrap script as well).

The file is 8MB, perhaps we can subset artists with more than k songs