Open blue-blue272 opened 1 month ago
The training set contains AudioCaps. But how you obtain the captions for audiocaps?
Hi @blue-blue272, AudioCaps has human-annotated captions released with the data. We use the same captions.
The training set contains AudioCaps. But how you obtain the captions for audiocaps?