Open blue-blue272 opened 4 months ago
AudioSet only contains audio and event labels. How do you obtain the caption description for audios in the audioset dataset?
Please check this: https://github.com/XinhaoMei/WavCaps. It is in the paper, but probably not very obvious place.
-Yuan
AudioSet only contains audio and event labels. How do you obtain the caption description for audios in the audioset dataset?