Performance of `StereoNoFt` on Cityscapes

In the paper you state "Stereo networks generalize much better and have smaller synthetic-to-real domain transfer problems."

When I tried release-StereoNoFt.ckpt on a Cityscapes stereo image pair, it produces the following result: (the image is resized to 1024x512 using cv2.INTER_AREA) But on KITTI it's pretty good (image size 1280x384)

The monocular model release-StereoUnsupFt-Mono-pt.ckpt has the same phenomenon: bad on Cityscapes but good on KITTI. I also tried removing the car hood but without much improvement. Could you please give us a guide on how to reproduce the result of Figure 7?

xy-guo / Learning-Monocular-Depth-by-Stereo

Performance of `StereoNoFt` on Cityscapes #9