-
Hi, I would like to express my admiration for the excellent work you have presented in your paper. After downloading the repository and attempting to reproduce the results from 'both/large/pytorch_mod…
-
Are the models '630k-audioset-fusion-best.pt' and '630k-audioset-best.pt' trained and evaluated on AudioSet?
If so, how are they trained or evaluated on AudioSet?
Because videos in AudioSet contain …
-
Hey!
I'm trying to reproduce the results for the HTSAT-RoBERTa setup, with AudioCaps and Clotho as the training sets.
Results are good for Clotho-valid, and similar to the paper, but are much worse …
-
Thank you for providing such a wonderful job! I couldn't find the audiocaptioning pretraining model on Wavcaps. [CNN14-BART baseline,HTSAT-BART baseline],Can you provide it?
-
Dear Author,
First and foremost, congratulations on your commendable work!
Now, onto the inquiry.
In the directory "blacklist" of the dataset there are 3 json files: "blacklist_exclude_all_ac…
-
I tried to download clotho and macs data sets using aac-datasets-download --root "." macs, and encountered the problem RuntimeError: Invalid checksum for file MACS.yaml. After downloading macs.yaml
-
Dear author, I want to pretrain from scratch and reproduce the zero-shot audio classification result. Should I use the 'blacklist_exclude_ub8k_esc50_vggsound.json' as the blacklist file, and use the '…
-
Thank you very much for your outstanding contribution to the open source community, but I noticed that your evaluation index is different from the paper that proposed the Clotho-AQA dataset, namely Cl…
-
hi,
i am runing the CALP project but i dont know how to run it in multi-GPU mode in single node.
Given the script in `train-only-clotho.sh`, can you give some guidance on running CALP project in mu…
-
Thanks a lot for your great contribution, but I'm having some issues reproducing your work. For the metadata part in the code base, does the source data contained inside come from the Clotho-AQA datas…