castorini / daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.
MIT License
669 stars 61 forks source link

about image size and prompt length #43

Closed mylee95 closed 1 year ago

mylee95 commented 1 year ago

Hi, firstly thank you for sharing your great work.

While reading your paper and code, I had question about image size and prompt length. It seems like your code only accepts image size of 512x512 or 768x768, and with maximum prompt length of 75+2, which is limit of text encoder. However, if my understanding is correct, there is no problem with getting attribution maps while making any size of image with longer prompt.

Is it correct? Or is there any other reason that daam is coded with such a limitation?

daemon commented 1 year ago

In theory, there's no limit to context length or image size. My understanding is that the models themselves have those aforementioned constraints.

mylee95 commented 1 year ago

@daemon thank you for quick reply! I'll try changing code to make sure any image size or context length can be accepted.