Release of AnimalSpeak at HuggingFace

david-rx / BioLingual

Contrastive language-audio pretraining for bioacoustics

Apache License 2.0

16 stars 0 forks source link

Hi Julian, thanks for the message. The main differences between the released set and the full set described in the paper:

The released csv doesn't contain AudioCaps. It's easier to get elsewhere e.g. with a library like audiocaps-download
Several held-out sets are already removed from the released version. This includes an AnimalSpeak test set, a small eval set, as well as the eval and test sets from Watkins and CBI in the BEANS benchmark. I'll try to add at least the test set used for large-scale species prediction soon. Also, some Xeno-canto recordings were deduplicated before the release and training if they had extremely frequent captions.

After adding AudioCaps and processing, this should be the right set to approximately recreate the training. I don't have a good script to cleanly download and process these ready yet, but this is something I hope to share soon!

david-rx / BioLingual

Release of AnimalSpeak at HuggingFace #2