Closed turian closed 1 year ago
I'm curious if you could release a pretrained model with a much shorter receptive length?
This would be useful for fine-grained tasks, like music transcription and event transcription (with a smaller hop size)
Noticing that the convolution does this, and you can just remove the averaging
I'm curious if you could release a pretrained model with a much shorter receptive length?
This would be useful for fine-grained tasks, like music transcription and event transcription (with a smaller hop size)