antiboredom / videogrep

automatic video supercuts with python
https://antiboredom.github.io/videogrep
Other
3.34k stars 256 forks source link

Ability to recognize/choose speakers? #92

Open rwfeather opened 2 years ago

rwfeather commented 2 years ago

Would it be feasible to choose clips only from a certain speaker (or set of speakers)? Not sure what the workflow would be like.

Vosk supports doing speaker recognition (example here). I haven't tried it yet, so not sure if it's accurate enough for this purpose.

antiboredom commented 2 years ago

I'd be super interested in implementing this, or at least playing around to see if it's viable!