gitmylo / audio-webui

A webui for different audio related Neural Networks
MIT License
1.01k stars 94 forks source link

[QUESTION] Speaker Diarization #82

Open slavakurilyak opened 1 year ago

slavakurilyak commented 1 year ago

Speaker Diarization

Any plans to integrate a speaker diarization library, like pyannote-audio by @pyannote?

gitmylo commented 1 year ago

Good idea! This doesn't seem like something that will take long to implement, however, outputs will be as files probably, because gradio audio elements does not support an unknown amount of outputs, and you cannot add elements in gradio after you've already launched your app. Sadly. So a file output will have to do, outputting one audio file per speaker, or one audio file per separated speaker piece. It will be an option.