Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
How would one collect and train a neural net to predict and/or generate spatial audio trajectories from environment descriptions via llm in conjunction with this current model.
How would one collect and train a neural net to predict and/or generate spatial audio trajectories from environment descriptions via llm in conjunction with this current model.