guanfuchen / semseg

常用的语义分割架构结构综述以及代码复现 华为媒体研究院 图文Caption、OCR识别、图视文多模态理解与生成相关方向工作或实习欢迎咨询 15757172165 https://guanfuchen.github.io/media/hw_zhaopin_20220724_tiny.jpg
770 stars 164 forks source link

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation #19

Open guanfuchen opened 6 years ago

guanfuchen commented 6 years ago

related paper

摘要
The ability to perform pixel-wise semantic segmentation in real-time is of paramount importance in mobile applications. Recent deep neural networks aimed at this task have the disadvantage of requiring a large number of floating point operations and have long run-times that hinder their usability. In this paper, we propose a novel deep neural network architecture named ENet (efficient neural network), created specifically for tasks requiring low latency operation. ENet is up to 18 faster, requires 75 less FLOPs, has 79* less parameters, and provides similar or better accuracy to existing models. We have tested it on CamVid, Cityscapes and SUN datasets and report on comparisons with existing state-of-the-art methods, and the trade-offs between accuracy and processing time of a network. We present performance measurements of the proposed architecture on embedded systems and suggest possible software improvements that could make ENet even faster.
guanfuchen commented 6 years ago

model architecture

image

image

detail in table

image

image

guanfuchen commented 6 years ago

network design choices

image

image

image

image

image

image

image

guanfuchen commented 6 years ago

results

image

image

image

image

guanfuchen commented 6 years ago

conclusion

image