Predicted depth range and metric depth setting

dcharatan / flowmap

[3DV 2025] Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

MIT License

893 stars 87 forks source link

There are no additional constraints regarding the minimum and maximum values for the "exp" setting. The outputs will naturally be between 0.01 and infinity. We don't apply additional scale or shift adjustments beyond what you see here:

https://github.com/dcharatan/flowmap/blob/19ac72b78d010220bd9487553db4fb463ab317d4/flowmap/model/backbone/backbone_midas.py#L80-L84

The "exp" setting works better for random initialization, since there's no clipping of the gradients due to ReLU like there is with the "original" setting. We use the "original" setting because that's what the pre-trained MiDaS network uses. If you want to train a new initialization checkpoint totally from scratch, it might be worth exploring whether "exp" works better. Just make sure the values at initialization are reasonable (i.e., not extremely large because of the exp).

dcharatan / flowmap

Predicted depth range and metric depth setting #28