about data prepare - Githubissues

MycroftAI / mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

Apache License 2.0

853 stars 229 forks source link

about data prepare #97

Open garymmi opened 5 years ago

garymmi commented 5 years ago

I read the doc https://github.com/MycroftAI/mycroft-precise/wiki/Training-your-own-wake-word#how-to-train-your-own-wake-word/ It said making sure to leave a second or two of silence at the start of each recording, but with no silence after the wake word.

it means i can leave one second or two second of silence at beginning of a recoding? if yes and leave two second, when getting mfcc, only processed 1.5 sec audio file, strange.
if leave one second, at beginning of a recoding, it means only 0.5 sec is wake up word ?
how to process lots of recodings to prepare training data? should i do VAD and then padding silence at begin and cut silence at end?

MatthewScholefield commented 5 years ago

Data is processed as follows:

If longer than 1.5 seconds: chop to 1.5 seconds
If shorter than 1.5 seconds: add perfect silence in beginning to make it 1.5 seconds

Since in reality there's static in microphones, when recordings are too short and it inserts perfect silence that could degrade the real world performance of the model. Having too much audio before the wake word should never be a problem since it chops it to the correct size.

Let me know if this males sense

garymmi commented 5 years ago

Data is processed as follows:

If longer than 1.5 seconds: chop to 1.5 seconds

chop from start of the recording ? it means wake up word must exist between 0~1.5 sec, > 1.5 sec will be cut off, right ?

MatthewScholefield commented 5 years ago

No, the last 1.5 seconds of audio are kept and the beginning of the audio is removed.

garymmi commented 5 years ago

if i want to shorten 1.5 second, what codes need to be modified? Thank you

MatthewScholefield commented 5 years ago

Change buffer_t=1.5 in precise/params.py

On Thu, Oct 31, 2019, 12:28 AM garymmi notifications@github.com wrote:

if i want to shorten 1.5 second, what codes need to be modified? Thank you

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/MycroftAI/mycroft-precise/issues/97?email_source=notifications&email_token=ABM2KSZPJ5U3XZJ5HBGVXATQRJUITA5CNFSM4JGUQMUKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOECWTLHI#issuecomment-548222365, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABM2KS3AOYA336JWMZR5DITQRJUITANCNFSM4JGUQMUA .