Optical flow estimation between two images

Thank you for your interest in our work!

Small objects subject to motion are, as far as I know, an unsolved problem of optical flow estimation. Specifically, all optical flow methods that I am aware of do the estimation at a lower scale (or at a really low scale with coarse-to-fine flow estimation) where small/thin objects essentially are invisible. Imagine you have an object of size 32 pixels and use PWC-Net to estimate the flow, PWC-Net starts by downscaling the images 6 times before performing the coarse-to-fine flow estimation. As such, the object of size 32 pixels is now 32/2^6==0.5 pixels and hence essentially invisible.

RIFE downscales the image 3 times, so the problem is less pronounced but it is still there. However, you might be hitting another problem, the official models were not trained on high-resolution footage. You might not see good results because the motion in your samples may be greater (in terms of pixels) than what the optical flow network was trained on. For example, consider training an optical flow method on FlyingThings3D which has a resolution of 960x540 and an average motion magnitude of X pixels. If you used this trained optical flow estimator on a 4K version of FlyingThings3D, which has an average motion magnitude of 4*X pixels, you will probably get poor results.

For high-resolution footage, I am under the impression that your best bet is doing iterative flow upsampling. For more information, see Section 5 of "Splatting-based Synthesis for Video Frame Interpolation". It will still struggle with thin objects though.

sniklaus / softmax-splatting

Optical flow estimation between two images #53