questions about motion mask

google / dynibar

Implementation of DynIBaR Neural Dynamic Image-Based Rendering (CVPR 2023)

https://dynibar.github.io/

Apache License 2.0

852 stars 40 forks source link

questions about motion mask #4

Open tb2-sy opened 1 year ago

tb2-sy commented 1 year ago

Thanks for your great work! The motion segmentation method you proposed is very novel, but I would like to ask whether the previous mask-rcnn+epipolar distance method in dynamic nerf will cause the motion mask to fail due to the long video and complex camera trajectories setting in this work？

zhengqili commented 1 year ago

Hi, I think Maskk-RCNN + epipolar distance could work for long videos if estimated optical flow is accurate enough. However, it could fail if estimated flow is not accurate (somehow this issue always happens for the dynamic scenes datasets)

Indeed, we recently found that for in-the-wild monocular videos, initializing our motion segmentation module with those masks can provide more robust estimates, especially for degenerate cases such as co-linear camera-object motion, where photmometric inconsistency might not be sufficient to explain away the motion.

tb2-sy commented 1 year ago

Hi, I think Maskk-RCNN + epipolar distance could work for long videos if estimated optical flow is accurate enough. However, it could fail if estimated flow is not accurate (somehow this issue always happens for the dynamic scenes datasets)

Indeed, we recently found that for in-the-wild monocular videos, initializing our motion segmentation module with those masks can provide more robust estimates, especially for degenerate cases such as co-linear camera-object motion, where photmometric inconsistency might not be sufficient to explain away the motion.

Thanks for your reply, I am now experimenting on the scene (Famil) with dynamic elements in the Tank and template dataset. What confuses me is that the result of the motion mask is that the whole picture is completely white, that is, the whole picture is regarded as dynamic, but in fact there are only few dynamic elements in the picture.

zhengqili commented 1 year ago

Are you referring the method in dynibar or " Maskk-RCNN + epipolar distance"?

tb2-sy commented 1 year ago

Are you referring the method in dynibar or " Maskk-RCNN + epipolar distance"?

I am referring to the "Maskk-RCNN + epipolar distance" method that has failed. I am not sure whether it is the limitation of this method itself, or what I did wrong, thank you.

zhengqili commented 1 year ago

You should check if all the moving regions are included in mask-rcnn part or epipolar thresholding part. From my experience, this method should not fail completely for Tank and temple dataset.

tb2-sy commented 1 year ago

Thanks, I got it!