audeering / audtorch

Utils and data sets for audio and PyTorch
https://audeering.github.io/audtorch/
Other
83 stars 9 forks source link

Added Google Speech Commands V2 Dataset #49

Closed harritaylor closed 5 years ago

harritaylor commented 5 years ago

Summary

As title, added google speech commands dataset, with some choice designs for preprocessing as described in the next section.

Proposed Changes

Discussion

  1. silence: currently the DataSet simply points to samples in the _background_noise_ folder. These samples are 1min long, whereas the speech commands are 1sec long. My current workaround is to use RandomCrop with 16,000 samples, which deals with this issue nicely. I don't think it would be efficient to chop up and store up 1 second clips of the silence clips.
hagenw commented 5 years ago

Cool, thanks for your hard work. I added just one small comment.

And if you like, feel free to add your name in LICENSE under the Contributors: section.

harritaylor commented 5 years ago

Cool, thanks!