I realized that the file audioset_unbalanced_train.json required for training the audio model is missing. Could you please share this file or let me know how to acquire this.
To make your own copy, you may download the videos from the list of audioset, and then trim the video and audio according to the annotated timestamps. To enrich the captions, you may refer to CLAP for their used captions.
Hi,
Thanks for releasing this great project!
I realized that the file
audioset_unbalanced_train.json
required for training the audio model is missing. Could you please share this file or let me know how to acquire this.