Closed daniel-deychakiwsky closed 8 months ago
@daniel-deychakiwsky I can't quite remember now, but there was likely no significant reason.
We reduced dimensions by using strides instead of pooling. Input shape was not square (T x F spectrogram, where T is not F), and continuously using [(1,2), (2,1)] would waste parameters.
Hey team! I'm looking into the model definition here.
Should
be
Or is the former intended and, if so, why?