Open ghost opened 3 years ago
Why not using hey snips negative samples ?
Right now, I'm using the positive samples to get FRR, and negative samples for FAR, both from the hey-snips dataset. This question was just for FRR, but do you mean we should be passing both negative and positive samples for FRR?
For FAH, concatenation should be fine but for positive, you should be careful. you might get multiple positive results around one occurrence of hey snips.
I did notice this! I'm accounting for this by determining whether the previous posterior was above the threshold, if so, only counting it as a single positive result. I made sure to add three seconds in-between each sample, which should be sufficient to ensure this procedure will work. I tested on a small set, and it seemed to work well. But, please let me know if you have any suggestions of how to better handle this.
One simple solution is not using concatenation and pass separate segment one by one
I'm working on the script to evaluate our wakeword models, and my current approach to calculate FRR is:
This all seems well and good, and it's clear that we can then adjust the posterior threshold to find the appropriate setting for our desired FRR (or sweep over for evaluation), but thus far the model is not detecting any wakewords using this pipeline. It definitely does when I speak into the microphone, so I'm wondering if this is the best way to go about testing.
Do any of you have thoughts or references I could check out to guide the process?