Few questions about the paper

jaeyeonkim99 / EnCLAP

Official Implementation of EnCLAP (ICASSP 2024)

MIT License

85 stars 4 forks source link

Hello,

Thank you very much for this great work. I have few questions about the paper/code. 1- Have you tried training with Wavcaps or a larger dataset? From the wavcaps paper, it seems that using more data significantly improved the results 2- Are the results reported in the paper use the checkpoint with the highest validation score? 3- From the ablations, it seems that MCM does not contribute much to the results (Cider drops by only 0.02 points), I am wondering if you have performed any ablation on the audiocaps dataset, especially with regard to the main components (MCM, and CLAP)

Thank you very much for your help.

jaeyeonkim99 / EnCLAP

Few questions about the paper #9