guanfuchen / semseg

常用的语义分割架构结构综述以及代码复现 华为媒体研究院 图文Caption、OCR识别、图视文多模态理解与生成相关方向工作或实习欢迎咨询 15757172165 https://guanfuchen.github.io/media/hw_zhaopin_20220724_tiny.jpg
770 stars 164 forks source link

Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network #51

Open guanfuchen opened 5 years ago

guanfuchen commented 5 years ago

related paper

摘要
One of recent trends [30, 31, 14] in network architecture design is stacking small filters (e.g., 1x1 or 3x3) in the entire network because the stacked small filters is more efficient than a large kernel, given the same computational complexity. However, in the field of semantic segmentation, where we need to perform dense per-pixel prediction, we find that the large kernel (and effective receptive field) plays an important role when we have to perform the classification and localization tasks simultaneously. Following our design principle, we propose a Global Convolutional Network to address both the classification and localization issues for the semantic segmentation. We also suggest a residual-based boundary refinement to further refine the object boundaries. Our approach achieves state-of-art performance on two public benchmarks and significantly outperforms previous results, 82.2% (vs 80.2%) on PASCAL VOC 2012 dataset and 76.9% (vs 71.8%) on Cityscapes dataset.
guanfuchen commented 5 years ago

image

image

image

image

image

image

guanfuchen commented 5 years ago

image

image

image

image

image

guanfuchen commented 5 years ago

conclusions

image

guanfuchen commented 5 years ago

mIoU on val=0.64

image

image

image

image