Single view depth has limitation of generalizing

tinghuiz / SfMLearner

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

MIT License

1.97k stars 557 forks source link

I agree. Single-view and stereo depth estimation are two very different problems. For single-view, the model can only rely on semantic cues specific to the training set. Therefore, it's unreasonable to expect the model to generalize to test scenes that have dramatically different statistics than the training scenes (e.g. a model trained on outdoor driving sequences is unlikely to work well on indoor scenes or the test images are rotated 180 degrees while training images are not). On the other hand, the stereo model can utilize not only the semantic cues but also geometric cues for depth estimation, which are much more generalizable.

tinghuiz / SfMLearner

Single view depth has limitation of generalizing #50