Labbeti / conette-audio-captioning

CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding
https://arxiv.org/pdf/2309.00454.pdf
11 stars 0 forks source link