kyungyunlee / ismir2018-revisiting-svd

Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook
66 stars 9 forks source link
deep-learning ismir machine-learning music-information-retrieval singing-voice

Revisiting Singing Voice Detection : a quantitative review and the future outlook

This repo contains code for the paper "Revisiting Singing Voice Detection: a Quantitative Review and the Future Outlook" by Kyungyun Lee, Keunwoo Choi and Juhan Nam at the 19th International Society for Music Information Retrieval Conference (ISMIR) 2018. [pdf, blog post]

Requirements

Public Dataset

Dataset for stress testing (section 5)

To generate dataset, run

Reproduction of singing voice detection models (section 3)

There are 3 reproduced models in the following folders :

Commandline arguments are :

In each model folder, audio processor to preprocess data must be run before playing around with the model.

To train models, run the following in each model folder

References

TO DO (2018.06)