Taited / clip-score

Quick scripts to calculate CLIP text-image similarity
Apache License 2.0
182 stars 15 forks source link

CLIP-T, CLIP-I implementation #3

Open Sundragon1993 opened 2 months ago

Sundragon1993 commented 2 months ago

Dear author,

Thanks so much for the great contribution to the community, in recent SD benchmark models, they often mention the subject fidelity using CLIP-I and DINO, for prompt fidelity they used either CLIP-T or COCA. Do you have any plans to implement these metrics? or just give some insights are also valuable. Once again, thank you!

Taited commented 2 months ago

Hi,

Thank you for your appreciation of this repository!

Recently, I've received several requests from the community. Unfortunately, I'm quite busy at the moment, but I hope to make these improvements within the next month.

If you'd like to implement this on your own, please check this line. You can replace the model with your desired encoder and pay attention to the function call at this line.

Also, pull requests are always welcome!