In the paper you state "Stereo networks generalize much better and have smaller synthetic-to-real domain transfer problems."
When I tried release-StereoNoFt.ckpt on a Cityscapes stereo image pair, it produces the following result: (the image is resized to 1024x512 using cv2.INTER_AREA)
But on KITTI it's pretty good (image size 1280x384)
The monocular model release-StereoUnsupFt-Mono-pt.ckpt has the same phenomenon: bad on Cityscapes but good on KITTI. I also tried removing the car hood but without much improvement.
Could you please give us a guide on how to reproduce the result of Figure 7?
In the paper you state "Stereo networks generalize much better and have smaller synthetic-to-real domain transfer problems."
When I tried
release-StereoNoFt.ckpt
on a Cityscapes stereo image pair, it produces the following result: (the image is resized to 1024x512 usingcv2.INTER_AREA
) But on KITTI it's pretty good (image size 1280x384)The monocular model
release-StereoUnsupFt-Mono-pt.ckpt
has the same phenomenon: bad on Cityscapes but good on KITTI. I also tried removing the car hood but without much improvement. Could you please give us a guide on how to reproduce the result of Figure 7?