Open siqi0905 opened 1 year ago
Hi, since we have the binary mask for input image, so we know the indices of known/unknown pixels. And yes, for layers with different resolutions (e.g., 64,32,16,8) , I has different sizes, the mask should be resized to match the corresponding resolution.
ๅไปถไบบ: ไธๆ็ช @.> ๅ้ๆถ้ด: 2023ๅนด10ๆ14ๆฅ 19:14 ๆถไปถไบบ: ysy31415/unipaint @.> ๆ้: Subscribed @.***> ไธป้ข: [Ext] [ysy31415/unipaint] About the detail of Masked attention control (Issue #2)
CAUTION: External email. Do not reply, click on links or open attachments unless you recognize the sender and know the content is safe.
In equation (9), how is I obtained? Is the dimension of I different in each attention layer? How are they obtained to generate ๐๐๐ก๐ก๐ [๐, ๐]? Thank you very much for your response๏ผ
โ Reply to this email directly, view it on GitHubhttps://github.com/ysy31415/unipaint/issues/2, or unsubscribehttps://github.com/notifications/unsubscribe-auth/BBVALB3NQXZNKT7F6XCYIODX7JXYHANCNFSM6AAAAAA6AF5KTA. You are receiving this because you are subscribed to this thread.Message ID: @.**@.>>
if the size of the binary mask for the input image is (1, 512, 512), do you mean we just need to resize it to (1, 64, 64), (1, 32, 32), ..., (1, 8, 8)?
Yes, thatโs right
In equation (9), how is I obtained? Is the dimension of I different in each attention layer? How are they obtained to generate ๐๐๐ก๐ก๐ [๐, ๐]? Thank you very much for your response๏ผ