magronp / bregmisi

Phase recovery with the Bregman divergence for audio source separation
6 stars 1 forks source link

Phase recovery with Bregman divergences for audio source separation

This repository contains the code for reproducing the experiments in our paper entitled Phase recovery with the Bregman divergence for audio source separation, published at the IEEE International Conference on Audio, Speech and Signal Processing (ICASSP) 2021.

Getting the data

After cloning or downloading this repository, you will need to get the speech and noise data to reproduce the results.

Note that you can change the folder structures, as long as you change the speech and noise directory paths accordingly in the code.

Then, simply execute the prepare_data.py script to create the noisy mixtures.

Getting the pre-trained model

To run the experiments, you will need to first estimate the spectrograms of the sources, which is done using the pytorch implementation of the Open Unmix model trained for a speech enhancement task.

The pre-trained model for estimating the speech and noise spectrograms is available here. You should place the .json and .pth files in the open_unmx/ folder. Note that you should also rename the .pth files simply as speech.pth and noise.pth.

Reproducing the experiments

Now that you're all set, simply run the following scripts:

Reference

If you use any of this code for your research, please cite our paper: ```latex @inproceedings{Magron2021, author={P. Magron and P.-H. Vial and T. Oberlin and C. F{\'e}votte}, title={Phase recovery with {B}regman divergences for audio source separation}, booktitle={Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, year={2021}, month={June} } ```