Thanks for opening codes. However, I notice that some codes in VideoCrafter/lvdm/modules/attention.py have a few mistake.
First, in Line 94 and Line 110, the relative position seems wrong.
Then, in Line 123, I believe that "out_ip = rearrange(out, '(b h) n d -> b n (h d)', h=h" should be "out_ip = rearrange(out_ip, '(b h) n d -> b n (h d)', h=h".
Please authors check the code and fix them soon if they are bugs. Otherwise, I hope authors could explain this code.
Thanks!
Thanks for opening codes. However, I notice that some codes in VideoCrafter/lvdm/modules/attention.py have a few mistake. First, in Line 94 and Line 110, the relative position seems wrong. Then, in Line 123, I believe that "out_ip = rearrange(out, '(b h) n d -> b n (h d)', h=h" should be "out_ip = rearrange(out_ip, '(b h) n d -> b n (h d)', h=h".
Please authors check the code and fix them soon if they are bugs. Otherwise, I hope authors could explain this code. Thanks!