confusing things in CLAPScore evaluation

Text-to-Audio / Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

MIT License

750 stars 110 forks source link

Open PeiwenSun2000 opened 2 months ago

PeiwenSun2000 commented 2 months ago

I have try to use the CLAPScore to eval my own model after generation. however, I find two confusing things.

CLAPScore is different, even if I maintain the generated audio unchanged. Where did the random factor come from? How can I get a fixed test result.
My CLAPScore is higher (better) than the GROUND TRUTH. Although I use more data to train this model, is it possible?