PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Apache License 2.0
30
stars
1
forks
source link
could you add the training script into the repo? #2
Hi, could you add the training script into the repo?