Closed ApayRus closed 3 years ago
@Aparus you can tweak the VAD included in aeneas:
and also the way the boundaries are set:
but the raw truth is that the VAD included in aeneas is very rough (just compares the spectral energy). The VAD included in Audacity probably works better because it implements a better algorithm.
Currently there is no way of hooking in a different VAD implementation, you would need to run aeneas from source (e.g., from an editable installation) and change vad.py yourself.
In the past, I tried the WebRTC via https://github.com/wiseman/py-webrtcvad but it has some limitations/problems, so I did not integrate it in aeneas "open source".
I might consider supporting a better VAD or even allowing users to hook-in custom VADs in aeneas 2.0.0, but that will not happen any time soon.
Hi! Thank you for your greatest library. I so love it and use it last weeks! :) I runned
vad
and got file withspeach
,nonespeach
intervals. Then I visualized it inAudacity
by "import --> labels". (track 1)Also I runned Audacity
analyze --> sound finder
with different params and got track 2 and 3.How we can see,
Aeneas/vad
eats parts of speech, butAudacity/sound finder
don't , and works properly. Is there a way to changeAeneas/vad
parameters, how we can change them inAudacity/sound finder
?If you ask me, why I need this... I want to add gaps between phrases. Aeneas marks phrases in that way (without gaps):
I need phrases in that way (with gaps):
Also you can take a look at my site where I implement all this things frazy.me