Closed JunHeum closed 3 years ago
Your code looks alright. FLAVR is designed (and trained) to take multiple frames from video as input and render the output video with higher frame rate. As such, duplicating input frames is suboptimal as it does not reflect the true motion characteristics in the video, hence the performance with duplicating triplets will be low.
I understand why the suboptimal results appear when duplicating input frames.
Since FLAVR takes 4 input frames, its performance seems to get a large gain compared to other methods.
Isn't it unfair compared to other video interpolation methods that use 2 frames as input?
In the paper, we also show comparisons with QVI which takes 4 input frames.
Multi-frame interpolation algorithms (SuperSlomo, QVI, EQVI, FLAVR) and single-frame interpolation algorithms (DAIN, CAIN, SoftSplat) are usually compared on different benchmarks. FLAVR has made some efforts to unify these two types of algorithms.
@hzwer Yes, that's is right. we indeed make an attempt to unify benchmarks across all the prior works which work in very different settings.
Hi,
I am impressed with your new video frame interpolation paper.
When I tested, I got 32.59dB in vimeo90K triplet test set.
According to your Middleburry.py in dataset directory, I fixed VimeoSepTuplet class to VimeoTriplet class like below.
What is the problem in my fixed code?
I am wondering if I could get custom triplet interpolation code which takes two input frames and yields an intermediate frame.