pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch
https://pytorch.org/audio
BSD 2-Clause "Simplified" License
2.54k stars 654 forks source link

[v2.0.1] Release Tracker #3237

Closed nateanl closed 1 year ago

nateanl commented 1 year ago

This issue is for tracking TODO lists which are supposed to be cherry-picked to the release branch. Following is release branch for the 2.0.1 release.

mthrok commented 1 year ago

https://github.com/pytorch/audio/pull/3230 resolves https://github.com/pytorch/audio/issues/3218, which existed since earlier release


mthrok commented 1 year ago

https://github.com/pytorch/audio/pull/3204 fixes issue with multiple formats, which existed since the initial release of StreamReader.


mthrok commented 1 year ago

https://github.com/pytorch/audio/pull/3243 and https://github.com/pytorch/audio/pull/3248 fixes issue with FFmpeg backend which was introduced in 2.0


nateanl commented 1 year ago

3252 and #3265 fix WavLM model when it is called in torch.no_grad().


nateanl commented 1 year ago

3253 improves Wav2Vec2/HuBERT by using scaled_dot_product_attention to increase efficiency.