git clone https://github.com/XinhaoMei/DCASE2021_task6_v2.git
conda env create -f environment.yml -n name
OSError: sndfile library not found
, please try conda install -c conda-forge libsndfile
data_prep.py
to prepare the h5py filescoco_caption/get_stanford_models.sh
to download the libraries necessary for evaluating the metrics.settings/settings.yaml
python train.py -n exp_name
rl
block in settings/settings.yaml
python finetune_rl.py -n exp_name
For more details, please refer to our technical report (pdf, 2022), (pdf, 2021) and paper (pdf).
If you use our code, please kindly cite following:
@inproceedings{Mei2021,
author = "Mei, Xinhao and Huang, Qiushi and Liu, Xubo and Chen, Gengyun and Wu, Jingqian and Wu, Yusong and ZHAO, Jinzheng and Li, Shengchen and Ko, Tom and Tang, H. and Shao, Xi and Plumbley, Mark D. and Wang, Wenwu",
title = "An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning",
booktitle = "Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021)",
address = "Barcelona, Spain",
month = "November",
year = "2021",
pages = "206--210",
isbn = "978-84-09-36072-7",
doi. = "10.5281/zenodo.5770113"
}