facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
MIT License
20.23k stars 2.03k forks source link

The input of AudioGen includes both text and audio? #320

Open jmren168 opened 9 months ago

jmren168 commented 9 months ago

Hi,

Is it possible the input of AudioGen includes both text and audio, then AudioGen geneartes an audio according to them? For example, given a pump normal operation audio and text "cavitation", AudioGen can generate an abnormal audio containing both normal and cavitation sounds. Any hints are appreciated.