Is "caption" meaningful for the model?

sjtuplayer / anomalydiffusion

[AAAI 2024] AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model

MIT License

114 stars 14 forks source link

Hi there,

I've noticed that within the inputs of the model, the "caption" is randomly selected from the strings in ldm/data/personalized.py => imagenet_templates_small.

However, given that this selection process is random, I'm curious about the significance of this caption for the model. Since both training and inference processes utilize this caption as one of the inputs, it's likely to have an impact on the model's output. Therefore, I find it a bit puzzling and would like to confirm with you, if I may be misunderstanding something.

sjtuplayer / anomalydiffusion

Is "caption" meaningful for the model? #30