Closed jhkonan closed 2 years ago
During the training, there is an additional drop_band
function will be performed. Since the adjacent sub-band features are similar, during the training of the sub-band model, the FullSubNet will reasonably drop a half in sub-band features. It aims to speed up training without significant performance degradation. Check here for more details.
If you want to hack training, change num_groups_in_drop_band
(here) to 1
is the simplest method to archive the dimension you desired.
Thank you for the swift response. This does the trick, but you are right about the speed -- I need to half my batch size and training takes twice as long. Could we instead use duplicates of the sub-band features during training? Or, maybe get the enhancement using a lower resolution noisy?
I will try to understand this better while training using your suggestion. I appreciate your help.
During training, with batch size 10, we observe the following shapes:
However, during validation, we see:
Why is dimension 1 and 2 of the cRM different during training but not during validation?
Without these, I am unable to get the enhanced waveform during training, since this calculation fails: