Open AIGR-sw opened 1 year ago
Any update on this issue?
We're sorry for the late response. We've been dealing with some recent deadlines that have kept us busy. Thank you for your interest in our work, and we apologize for any confusion caused by the issues you raised.
1) Thank you for bringing this to our attention. You are correct: while the two stem blocks in our model ( self.stem_block_depth and self.stem_block_rgb ) do not share weights, the remaining part of the network does indeed, as you have observed in our Python implementation of the backbone in vgg.py. We have addressed this issue in a recent extension of our work (which is currently under submission) in which we adopted different backbones while ensuring they do not share weights.
2) Thank you for your comment regarding the network architecture described in our supplementary material. After reviewing our implementation, we acknowledge that the description in the supplementary material is inaccurate, as we wrongly reported the inputs to the layers conv1_o and conv2_o.
3) We have verified that our implementation follows this definition and we are confident in the correctness of our results. In particular, we define disparity as the sum of the classification score and the offset (line 66-71 in refiner.py), and if the absolute difference between this value and the target ground-truth disparity is within the range of [-1,1], we include it in the final loss (line 115 in refiner.py).
If you have any further concerns or questions, please do not hesitate to let us know.
Thank you for your work!
I have noticed several inconsistencies between your paper and the Python implementation in this Github repository. Specifically, I have identified the following three inconsistencies:
vgg13_rgb = torchvision.models.vgg13(pretrained=False) self.downsample_2_rgb = vgg13_rgb.features[4:9] self.downsample_4_rgb = vgg13_rgb.features[9:14] self.downsample_8_rgb = vgg13_rgb.features[14:19] self.downsample_16_rgb = vgg13_rgb.features[19:24]