Open iver56 opened 4 years ago
Related question: we should probably include STFT in the deterministic transforms. Should we support torchaudio's and asteroid's?
Sounds good to me
Thinking out loud: would it make sense to make no API distinction between feature extraction and augmentation? Spectrogram extraction could be seen as an augmentation of audio.
It would allow an API like that:
augmenter = Chain(AddBackgroundNoise(), Spectrogram(), SpecAugment())
augmented_spectrogram = augmenter(audio)
That's what I have in mind at least. Everything is a transform, and augmentations are just probabilistic ones.
Sounds good to me
Cool. So I'll think about outsourcing asteroid's filterbank API to another repo then.
E.g. transforms like this: https://github.com/zcaceres/spec_augment