hcmlab / vadnet

Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
http://openssi.net
GNU Lesser General Public License v3.0
419 stars 77 forks source link

About frame size #21

Open loretoparisi opened 4 years ago

loretoparisi commented 4 years ago

Thanks for this project @frankenjoe I have noticed that the frame size set to 1 sec. So in this file

[0 0 0 0 0 0 1 0 0 0 1 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 1 1 1 1 0 0 1
 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 1 1 1 1 1 1 1 1 1 1 0 0 1 1 0 0 0 1 0 1 0 0 1 0 1 1 1 1 1 1 1 1 0 1 1 1 1
 1 0 0 1 1 0 0 0 0 0 0 1 1 1 0 0 1 1 1 1 1 0 1 1 1 1 1 0 1 0 1 1 0 0 1 0 0
 1 1 1 1 0 0 0 0 0]

I have 194 labels noise/speech since the file is 194 seconds. Is it possible to set an arbitrary size to the frame size, specifically under the reference frame of 1sec? Like 0.25 sec?

Thank you.