Attention layer used in bottleneck

sp-uhh / storm

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

MIT License

164 stars 22 forks source link

In the paper, when discussing the network architecture and comparing NCSN++M with NCSN++, it is said that "the attention layer in the bottleneck is removed". However, it seems to me that by setting attn_resolutions = (0,) here, no attention layers are used at all except in the bottleneck layer here. This is also the case in the previous sp-uhh/sgmse repo (here and here). So it seems to me the correct statement describing the new model should be "the attention layers are removed except in the bottleneck layer".

sp-uhh / storm

Attention layer used in bottleneck #3