Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

resnetをベースに発想の転換（skip-conn. のパスを main-streamと考える）したsemantic segmantationモデル

we propose a novel ResNet-like architecture that exhibits strong localization and recognition performance. We combine multi-scale context with pixel-level accuracy by using two processing streams within our network: One stream carries information at the full image resolution, enabling precise adherence to segment boundaries. The other stream undergoes a sequence of pooling operations to obtain robust features for recognition. Without additional processing steps and without pre-training, our approach achieves an intersection-over-union score of 71.8% on the Cityscapes dataset.

furukawa-ai / deeplearning_papers

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes #63