Using MMS model with `star` token for batch size > 1

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

https://pytorch.org/audio

BSD 2-Clause "Simplified" License

2.43k stars 635 forks source link

Open huangruizhe opened 2 months ago

huangruizhe commented 2 months ago

However, the underlying Wav2vec model supports batch size greater than one. So this line should instead be:

star_dim = torch.zeros((output.size(0), output.size(1), 1), dtype=output.dtype, device=output.device)