Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
MIT License
20.23k
stars
2.03k
forks
source link
The input of AudioGen includes both text and audio? #320
Is it possible the input of AudioGen includes both text and audio, then AudioGen geneartes an audio according to them?
For example, given a pump normal operation audio and text "cavitation", AudioGen can generate an abnormal audio containing both normal and cavitation sounds. Any hints are appreciated.
Hi,
Is it possible the input of AudioGen includes both text and audio, then AudioGen geneartes an audio according to them? For example, given a pump normal operation audio and text "cavitation", AudioGen can generate an abnormal audio containing both normal and cavitation sounds. Any hints are appreciated.