About reproduce the result in 'Understanding Deformable Alignment in Video Super-Resolution'

XPixelGroup / BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Apache License 2.0

6.92k stars 1.21k forks source link

Hi, thanks for your great work!

I'm trying to visualize the result presented in paper 'Understanding Deformable Alignment in Video Super-Resolution'. According to the paper, I expect the offset will be similar to optical flow. However, I visualized the offset in Layer 1, the number is much smaller than optical flow.

Here is what I did: I used the pretrained EDVR model 'EDVR_L_x4_SR_Vimeo90K_official-162b54e4.pth' and visualized the L1 offset right before the cascading part. The offset tensor has shape (1, 128, h, w) and I reshaped it to (1, 2, 8=group_number, 8=kernal_h*kernal_w, h, w), and randomly pick one (2, h, w) to visualize.

By my observation, I didn't see a big difference between L1 offset and the cascading offset mentioned in https://github.com/XPixelGroup/BasicSR/issues/404.

Can you help me with that? thanks

XPixelGroup / BasicSR

About reproduce the result in 'Understanding Deformable Alignment in Video Super-Resolution' #552