-
Very great work! The idea is very interesting and thank you for providing the codes.
After running the script `download_models.sh`, I found out that there are several pretrained models in the folde…
-
I did a spleeter pre-trained model split for some Hindustani classical music recording. Unfortunately the source separation was too aggressive and the audio was suppressed in many cases but there …
-
**Is your feature request related to a problem? Please describe.**
I primarily work with audio data and it is particularly challenging to visualize different stages of audio data like `wavefo…
-
# Speech Separation
Speech separation is the task of obtaining clean, single-speaker speech from a speech mixture of multiple overlapping speakers.
## Task Objective
**Why is this task needed…
-
The real-time interpretation of underwater sounds can be improved by applying OSS machine learning and time-series techniques to streams of audio and, where available, the real-time output of other en…
-
It would be great to get support for Real, Imag, and Complex. Anybody working with audio data has to deal with complex inputs and this is very difficult without support for these ops.
-
From your paper, I wasn't sure of the role/purpose of music_speech_audioset_epoch_15_esc_89.98.pt
Are these the saved model weights one should use if one wants to focus on separation of musical ins…
-
I am testing ODAS for possible use in a computer interactive art piece to isolate participants and performers giving voice commands from the ambient noise/music of the active space.
In my testing,…
-
what does the filter length represent?
-
Thank you for this great project! I frequently use the [Working With Local Files](http://naomiaro.github.io/waveform-playlist/newtracks.html) page to analyze and transcribe music created with a music …