guanfuchen / semseg

常用的语义分割架构结构综述以及代码复现 华为媒体研究院 图文Caption、OCR识别、图视文多模态理解与生成相关方向工作或实习欢迎咨询 15757172165 https://guanfuchen.github.io/media/hw_zhaopin_20220724_tiny.jpg
761 stars 164 forks source link

Speeding up Semantic Segmentation for Autonomous Driving #20

Open guanfuchen opened 5 years ago

guanfuchen commented 5 years ago

related paper

摘要
Deep learning has considerably improved semantic image segmentation. However, its high accuracy is traded against larger computational costs which makes it unsuitable for embedded devices in self-driving cars. We propose a novel deep network architecture for image segmentation that keeps the high accuracy while being efficient enough for embedded devices. The architecture consists of ELU activation functions, a SqueezeNet-like encoder, followed by parallel dilated convolutions, and a decoder with SharpMask-like refinement modules. On the Cityscapes dataset, the new network achieves higher segmentation accuracy than other networks that are tailored to embedded devices. Simultaneously the frame-rate is still sufficiently high for the deployment in autonomous vehicles.
guanfuchen commented 5 years ago

overview

image

architecture detail

image

encoder design

image

dilated convolution design

image

decoder and bypass design

image

guanfuchen commented 5 years ago

Training

image

result

image

image

guanfuchen commented 5 years ago

conclusion

image

guanfuchen commented 5 years ago

校准集上mIoU为0.57

image

image

image

image