Open DthdZK opened 2 weeks ago
Hi, I would like to know how to evaluate various metrics if I want to use a generative model like sd1.5 should I use color_val.txt to generate 3000 images and then use bash BLIPvqa_eval/test.sh to get a score that is Attribute color? And then the test Attribute Shape has to be generated using Shape_val.txt? I mean when I want to reproduce the corresponding metrics, I should use the corresponding val.txt to generate the test image, right?
Yes, you are correct. To evaluate various metrics for a generative models, use the corresponding val.txt files to generate the test images for each category.
Thank you. I get it!
To use this benchmark to evaluate other models, such as SDXL and SD3-medium, follow these steps:
I hope this helps! Let me know if you need further assistance.