daijifeng001 / MNC

Instance-aware Semantic Segmentation via Multi-task Network Cascades
Other
489 stars 182 forks source link

ROI Warping output width and height incorrect #15

Open hgaiser opened 8 years ago

hgaiser commented 8 years ago

I'm not sure if the paper is correct or if the model here on github is correct, but there is a discrepancy between the two. The code says the pooled width and height should be 14x14, however the paper claims it should be 28x28:

We expect the RoI warping layer to produce a sufficiently fine resolution, which is set as W' × H' = 28 × 28 in this paper. A max pooling layer is then applied to produce a lower-resolution output, e.g., 7×7 for VGG-16.

Am I interpreting something wrong, or is there a reason for this?

hgaiser commented 8 years ago

Judging by https://github.com/daijifeng001/MNC/commit/e2d820f6f0c71fd753db090a427981d3ef94885e it should have been 28x28 and then followed by a max pooling layer ?