mli0603 / stereo-transformer

Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers. (ICCV 2021 Oral)
Apache License 2.0
656 stars 106 forks source link

how accurate the retification needed for sttr to work #48

Open ynjiun opened 2 years ago

ynjiun commented 2 years ago

I tried to run the inference using a stereo images captured by my stereo camera using pretrained models. But none of them showing reasonable depth prediction. However, if I use Kitti dataset, then it looks good. This makes me think that this might means:

  1. my stereo images retification is not good enough, if so, is there a way (or metric) to specify how good a rectification need to be to run sttr?
  2. or sttr is only good for the dataset it was trained for and not so good to run arbitray stereo images set?

Thank you for sharing your insight in advanced.

mli0603 commented 2 years ago

Hi @ynjiun

Thanks for reaching out. I assumed you are using the KITTI pretrained model? The KITTI pretrained model is severely overfitted to the small dataset. For better generalization, please use the scene flow trained model. Would you mind giving that a try?