facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
MIT License
20.18k stars 2.01k forks source link

Many bug fixes for v1.1.0 #335

Closed adefossez closed 8 months ago

adefossez commented 8 months ago

This introduces quite a few bug fixes and some non backward compatible changes. From the changelog:

Fixed DAC support with non default number of codebooks.

Fixed bug when two_step_cfg was overriden when calling generate().

Fixed samples being always prompted with audio, rather than having both prompted and unprompted.

Backward incompatible change: A torch.no_grad around the computation of the conditioning made its way in the public release. The released models were trained without this. Those impact linear layers applied to the output of the T5 or melody conditioners. We removed it, so you might need to retrain models.

Backward incompatible change: Fixing wrong sample rate in CLAP (WARNING if you trained model with CLAP before).

Backward incompatible change: Renamed VALLEPattern to CoarseFirstPattern, as it was wrongly named. Probably no one retrained a model with this pattern, so hopefully this won't impact you!