Closed sachinravi14 closed 6 years ago
Hi, I just went through the papers in reading list. Regarding lyrics dataset, I was wondering that we can crawl lyrics from some website maybe.
Hi, I am working on lyrics generation for a while, I am gathering lyrics data from web and add them to github https://github.com/MohMehKo/lyrics/tree/master/artist_songs let me know if it is a good starting point.
In this project https://github.com/rasbt/musicmood by Sebastian Raschka.
The data collection is done by getting songs from million song dataset and then lyrics are scraped from lyricWikia more details here and demonstration here
Second @heaven00's idea. We will need to convert the track to MIDI format.
Looks like the Lakh MIDI dataset has lyrics attached to ~23800 MIDI files, which might be useful. I haven't looked at the quality of the dataset yet though.
edit: link
@korjani, the repository you linked to looks great! Can you give details about how the dataset was created?
These lyrics data sources could also be used as a good starting point.
Details of the requirements for the dataset can be found in the proposal. We are looking for suggestions for creating this dataset, including: