Closed VadimOvchinnikovA closed 4 years ago
mel is of shape (n_frames, batch) and the batch dimension is sorted by decreasing mel length. this guarantees that the first sample will not be padded, hence there are no padded zeros to send back to the end.
Heads up that we changed the code to cover cases where the batch with the larger text is does not have the longest mel.
I can't understand. Why are you iterating from 1 in forward method of AR_Back_Step?