soobinseo / wavenet

Audio source separation (mixture to vocal) using the Wavenet
21 stars 1 forks source link
source-separation tensorflow wavenet

wavenet

Description

Requirements

Data

I used DSD100 dataset which consists of pairs of mixture audio files and vocal audio files. The complete dataset (~14 GB) can be downloaded here. The data was pre-processed with sample_rate=16000, and was divided into 380ms units. Therefore, the number of timesteps for network input was then 6080 raw data.

File description

Training the network

Generate seperated vocal audio

Notes