Open Yuhuoo opened 2 hours ago
Hi @Yuhuoo,
Thanks for your feedback. As mentioned in #3, our visualization is based on the ground truth motion mask of DAVIS.
To get a better visualization when the motion mask is noisy, you could simply add a --no_mask
for the visualization code (i.e., python viser/visualizer_monst3r.py --data path/schoolgirls --no_mask
), which will not stack all the background pointclouds.
Alternatively, you could also add a --use_gt_mask
for the evaluation script here https://github.com/Junyi42/monst3r?tab=readme-ov-file#evaluation, which should be able to generate similar results as our online visualization.
Thanks.
Why does the result I reproduced perform worse than the result reported in your report? Here are the results of my reproduction.
https://github.com/user-attachments/assets/4b30d0b0-2070-4674-a65f-7d6d6e6b2d41