facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
MIT License
20.15k stars 2.01k forks source link

How to fuse several `WaveformConditioner`s #460

Open javanasse opened 1 month ago

javanasse commented 1 month ago

Conditioners are defined in config/conditioner like chroma2music.yaml. The chroma model's parameters are populated under self_wav:.

I have defined another conditioner, and would like to train MusicGen conditioned using two WaveformConditioners, namely the ChromaStemConditioner and the new conditioner that I defined. What is the best way to do that?

Is there a syntax for doing this in the conditioner YAML or is it better to create a new WaveformConditioner that simply returns a stacked tensor formed from both conditions?