slSeanWU / beats-conformer-bart-audio-captioner

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Apache License 2.0
30 stars 1 forks source link

How to reproduce the model training process? #1

Open keepcreative opened 6 months ago

keepcreative commented 6 months ago

Hi, Shih-Lun Wu, I'm trying to reproduce the model training process implemented in the paper, but i found there's only test process of the model explained in the readme file.

keepcreative commented 6 months ago

i'm begging for a more detailed readme file to demonstrate how to do the model training

slSeanWU commented 5 months ago

Hi @keepcreative , thanks for reaching out.

Would you please kindly refer to Section 3.1 of our paper (https://arxiv.org/pdf/2309.17352) for the training settings? Thanks.

-- Shih-Lun