XiangZ-0 / EVDI

Implementation of CVPR'22 paper "Unifying Motion Deblurring and Frame Interpolation with Events"
67 stars 5 forks source link

Questions about the number of output imgae #2

Open HaoDot opened 2 years ago

HaoDot commented 2 years ago

Hi , a nice work for event-based video unfolding! You have considered a practical setting, where the duty-cycle is not 1. However, there are still a few questions for me.

XiangZ-0 commented 2 years ago

Hi Ginobili-20,

Thanks for your interest in our work.

  1. For the number of output images, our EVDI model only outputs one latent frame at the timestamp of interest per inference. In the training stage, we will generate N latent frames per input (you can consider it as running the model for N times) for once loss backpropagation. Please see Train. py for detail.
  2. Your understanding is right :-) In our work, the tasks of deblurring and interpolation can be unified as discussed in Section 3. We separate the two tasks in the experiment section just to facilitate comparison with previous works.
HaoDot commented 2 years ago

Sorry for not replying immediately. Thx for your explanation! However, there is another remaining question. Hope you can reply to it again, thx. There are three loss functions in EVDI, Blurry-event loss, Blurry-sharp loss, and Sharp-event loss, respectively. Besides, from Table 3, it seems that Blurry-sharp loss plays an important role in supervision, others even can't help the model converge. For now, what is mentioned above is all from the original paper. But the fact is that Blurry-sharp loss can only play its role in the deblurring task, which means recovering the latent frame during the exposure time. So, B-S loss can not help the model to converge during the interpolation task, which is designed to recover the latent frames, not in exposure time. What's more, B-E loss will have a trivial solution, when 1 is equal to 2, and 3 is equal to 4 . And, S-E loss will be influenced by the noise in event streams. As shown in Table 3, all losses mentioned above can't work well. So, chances are that the model can not converge well. However, EVDI still has a strong performance in the interpolation task. I can't understand the fact that finetuning with B-E loss and S-E loss can achieve such a good result. I wonder if there are other training strategies that I have missed. To sum up, the existing problem for me is how to supervise EVDI in the interpolation task. Waiting for your reply! Thx. P.S. EVDI is still a brilliant work, which makes an impression on me!

XiangZ-0 commented 2 years ago

Thanks for your question.

Admittedly, EVDI is not perfect and there are some limitations in it such as the noise issue in S-E loss, but we hope EVDI could inspire more exciting works in the related field. Thanks.

HaoDot commented 2 years ago

Thanks for replying in detail again.