questions about the method

W-Ted / GScream

Official code for ECCV2024 paper: GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal

61 stars 5 forks source link

Hi, @chenj02. Thank you for your interest in our GScream!

Essentially, that's true. For the masked area, bidirectional cross-attention can help achieve smoother boundaries, but the reference view's RGBD is much more important for providing reference information.
Considering 3D constraints, our bidirectional cross-attention module applies regularization to the masked area. From the perspective of 2D supervision, we attempted to use perceptual loss or an additional learned discriminator to constrain the masked area in other views, which resulted in some improvements. However, we opted not to employ these 2D constraints since we found that the existing pipeline is already adequate for producing favorable results.

W-Ted / GScream