Questions about embedding function and forward pass of the model

kxhit / vMAP

[CVPR 2023] vMAP: Vectorised Object Mapping for Neural Field SLAM

https://kxhit.github.io/vMAP

Other

337 stars 21 forks source link

Closed Taekbum closed 1 year ago

Taekbum commented 1 year ago

Hello, thank you for sharing the code for this amazing research!

I have two questions:

The initial parameter values for the layer that embeds 3D points (UniDirsEmbed) in the code are fixed.
During the forward pass of the NERF model, the embedded input is split into two parts (emb_size1, emb_size2) and placed in different locations. Could you explain about these two things? Would changing them according to the iMap settings yield similar performance?

Thank you!

kxhit commented 1 year ago

Hi thanks for your interest!

The UniDirsEmbed is adopted from MipNeRF-360's embedding, where the fixed matrix is the basis Eq. 18 from the paper. Any other positional embedding could work as well.
The emb_size2 is for inputting a slightly higher frequency only for the color head to get more details. It gives a small improvement from our experiments but should perform similarly.

Hope it helps!

Taekbum commented 1 year ago

Thank you for your kind explanation!