Open qzyyyy opened 1 month ago
@qzyyyy Hello there! I'm here to assist you with any bugs, questions, or contributions while you wait for a human maintainer. Let's tackle this together!
Yes, you can use VAD (Voice Activity Detection) in OpenVoice2. The get_se
function allows you to choose between VAD and Whisper for processing audio. By setting the vad
parameter to True
, the function will use VAD to segment the audio. However, keep in mind that OpenVoice2 filters out audio segments shorter than 1.5 seconds, so these segments will not be saved or used for voice cloning [1].
To continue talking to Dosu, mention @dosu.
Will VAD effect the performance of OpenVoice2?
I want to clone a voice from audio segments that are shorter than 1.5 seconds after splitting, and I'm wondering if I can use VAD instead of Whisper in OpenVoice2.