[MusicGen] No attention mask for conditions?

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

MIT License

20.17k stars 2.01k forks source link

[MusicGen] No attention mask for conditions? #385

Open stg1205 opened 5 months ago

stg1205 commented 5 months ago

Why ignore the cond_mask in condition fuser? The mask is just used for zero indexes where the input is None or padding but there should be attention values on these zeros.