Open NivaucchuRabuessa opened 1 year ago
Hello! I'm coming from your post on r/MachineLearning. Japanese transcriptions are more accurate with a VAD and that's the only reason I keep using some very simple WebUI. Do you have any plan to integrate a detector?
Links for reference: VAD: https://github.com/snakers4/silero-vad WebUI I'm currently using: https://github.com/openai/whisper/discussions/397
Will dig into this for the next update! Thanks!
Will integrate with PyAnnote. Bumping this.
Hello! I'm coming from your post on r/MachineLearning. Japanese transcriptions are more accurate with a VAD and that's the only reason I keep using some very simple WebUI. Do you have any plan to integrate a detector?
Links for reference: VAD: https://github.com/snakers4/silero-vad WebUI I'm currently using: https://github.com/openai/whisper/discussions/397