Closed wanglong100 closed 3 years ago
Hi @DumBoFly ,
The original thought was to get more resolution in higher frequency bands. The MFCC features have the largest dynamic range in the first/second bands(the lower frequency), if we don't saturate them, will result in much smaller numbers in higher frequencies.
You can remove them to test if there is any different. but remember to do the same in the C code.
A similar operation is also implemented in the RNN denoise example.
Thanks,
Sorry for late reply. Thank you for your explanation and I think i have got it.
Hi jianjia,
I see the snip code in file (https://github.com/majianjia/nnom/blob/master/examples/keyword_spotting/kws.py) as below: ` def normalize(data, n, quantize=True): limit = pow(2, n) data = np.clip(data, -limit, limit) / limit if quantize: data = np.round(data * 128) / 128.0 return data
You select [-8,8] as the range for the dataset.
I wonder how to decide the feature range for the dataset. Thanks.