HKUST-3DV / DIM-SLAM

This is official repo for ICLR 2023 Paper "DENSE RGB SLAM WITH NEURAL IMPLICIT MAPS"
195 stars 11 forks source link

How to run with more frames? #2

Closed JunyuanDeng closed 1 year ago

JunyuanDeng commented 1 year ago

Thanks for sharing the code!

From my personal test, the default setting is to run for the first 15 frames. And the corresponding code is: self.init(15) in the 51 line of file dimslam.py.

How to run more frames? I simply change 15 to 150, but the results are terrible:

image

poptree commented 1 year ago

Hi,

The rebuttal on the openreview is helpful to answer this question. There is still a gap between DIM-SLAM and "nerf without pose" because DIM-SLAM relies on incremental sfm to recover both scenes and pose.

JunyuanDeng commented 1 year ago

Thanks for your reply!

Sorry, I'm still quite confused. So, is it mean that there has still gaps to run it for a full sequence of 2000 frames without pose input? Then what's the meaning of the tracking results shown in Table 2? Another question is whether the mapping results are based on the GT pose or not.

JunyuanDeng commented 1 year ago

I check the Table 2 (tracking result), does it means that the tracking process is only valid for datasets like fr1/desk, which has slow motion and small change of pose? And the mapping results on replica datasets is based a GT pose?

JunyuanDeng commented 1 year ago

Sorry, it's just an initialization step. Hope to see the full code!

poptree commented 1 year ago

So, is it mean that there has still gaps to run it for a full sequence of 2000 frames without pose input? Then what's the meaning of the tracking results shown in Table 2?

The mapping and tracking results are running on the whole sequence. As we mentioned in the implementation detail, the current version only contains the initialization of DIM-SLAM, while it is also the core part of the whole system. Following the system section in paper, you could easily extend the current sfm to the whole sequence. We will also release the code for the whole system.

When we talk about "nerf without pose" and slam, you should note that they are two different tasks. The recent Nope-Nerf is also based on the incremental sfm and it is far from "nerf without pose".

I check the Table 2 (tracking result), does it means that the tracking process is only valid for datasets like fr1/desk, which has slow motion and small change of pose? And the mapping results on replica datasets are based a GT pose?

We follow the imap and nice-slam to test DIM-SLAM on the same sequence. The mapping results on replica datasets are based the estimated pose.

JunyuanDeng commented 1 year ago

Thanks for your reply! Now I understand.