Closed mstadelmann closed 3 months ago
(In case that is relevant, this happens in 2D.)
add_attention
only controls the attention layer in the bottleneck (called middle block in the code). That's something we inherited from hugging face that I haven't come around to change yet; and it clearly doesn't seem to work as I've slimmed the attention blocks drastically in terms of parameters 😅
For an example of attention in up/down blocks, have a look at the tests. It's done by using AttnDownBlock
/ AttnUpBlock
in the down_block_types
and up_block_types
, respectively
OK, thanks!
How do I have to set up the attention blocks; what does the parameter
add_attention
do, and how does it link to the use of AttnUp/Down blocks?If I set it to true, I get