Closed kradonneoh closed 1 year ago
Hi!
For your questions:
Hope that helps!
Closing for now since this seems inactive, feel free to open if any more questions.
Thanks for response! I did have a few more questions about training and implementation:
(I couldn't find a way to re-open the issue, so I'm hoping you'll still get a notification for this)
Ahh sure thing:
Hope that helps!
Closing again since it seems inactive, feel free to open if any more questions.
Hey!
I had a few questions regarding the choices made when designing the ConvGRU network and getting your thoughts extensions to the dataset.
For the ConvGRU network, why did you decide to go with raw waveforms as opposed to Log Scale Mel Spectrograms (which often seems to be the first choice for convolutional style embedding networks)? Did you experiment with both and find the raw waveforms to be better? Also, did you ever try a fully convolutional approach / with a transformer or self-attention blocks?
In terms of the dataset, did you ever try multi-lingual data in addition to english data? I'm wondering if the addition of content that isn't english will help the model ignore content more than it does already