shahules786 / mayavoz

Pytorch based speech enhancement toolkit.
MIT License
328 stars 21 forks source link
audio-enhancement deep-learning denoiser pretrained-models python pytorch speech-enhancement

GitHub Workflow Status GitHub GitHub issues GitHub Repo stars

mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio practioners & researchers. It provides easy to use pretrained speech enhancement models and facilitates highly customisable model training.

| Quick Start | Installation | Tutorials | Available Recipes | Demo

Key features :key:

Demo

Noisy speech followed by enhanced version.

https://user-images.githubusercontent.com/25312635/203756185-737557f4-6e21-4146-aa2c-95da69d0de4c.mp4

Quick Start :fire:

from mayavoz.models import Mayamodel

model = Mayamodel.from_pretrained("shahules786/mayavoz-waveunet-valentini-28spk")
model.enhance("noisy_audio.wav")

Recipes

Model Dataset STOI PESQ URL
WaveUnet Valentini-28spk 0.836 2.78 shahules786/mayavoz-waveunet-valentini-28spk
Demucs Valentini-28spk 0.961 2.56 shahules786/mayavoz-demucs-valentini-28spk
DCCRN Valentini-28spk 0.724 2.55 shahules786/mayavoz-dccrn-valentini-28spk
Demucs MS-SNSD-20hrs 0.56 1.26 shahules786/mayavoz-demucs-ms-snsd-20

Test scores are based on respective test set associated with train dataset.

See tutorials to train your custom model

Installation

Only Python 3.8+ is officially supported (though it might work with Python 3.7)

conda env create -f environment.yml
conda activate mayavoz

Support

For commercial enquiries and scientific consulting, please contact me.

Acknowledgements

Sincere gratitude to AMPLYFI for supporting this project.