DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.
It combines (1) atrous convolution to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks, (2) atrous spatial pyramid pooling to robustly segment objects at multiple scales with filters at multiple sampling rates and effective fields-of-views, and (3) densely connected conditional random fields (CRF) as post processing.
This distribution provides a publicly available implementation for the key model ingredients reported in our latest arXiv paper. This version also supports the experiments (DeepLab v1) in our ICLR'15. You only need to modify the old prototxt files. For example, our proposed atrous convolution is called dilated convolution in CAFFE framework, and you need to change the convolution parameter "hole" to "dilation" (the usage is exactly the same). For the experiments in ICCV'15, there are some differences between our argmax and softmax_loss layers and Caffe's. Please refer to DeepLabv1 for details.
Please consult and consider citing the following papers:
@article{CP2016Deeplab,
title={DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs},
author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
journal={arXiv:1606.00915},
year={2016}
}
@inproceedings{CY2016Attention,
title={Attention to Scale: Scale-aware Semantic Image Segmentation},
author={Liang-Chieh Chen and Yi Yang and Jiang Wang and Wei Xu and Alan L Yuille},
booktitle={CVPR},
year={2016}
}
@inproceedings{CB2016Semantic,
title={Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform},
author={Liang-Chieh Chen and Jonathan T Barron and George Papandreou and Kevin Murphy and Alan L Yuille},
booktitle={CVPR},
year={2016}
}
@inproceedings{PC2015Weak,
title={Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation},
author={George Papandreou and Liang-Chieh Chen and Kevin Murphy and Alan L Yuille},
booktitle={ICCV},
year={2015}
}
@inproceedings{CP2015Semantic,
title={Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs},
author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
booktitle={ICLR},
year={2015}
}
Note that if you use the densecrf implementation, please consult and cite the following paper:
@inproceedings{KrahenbuhlK11,
title={Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials},
author={Philipp Kr{\"{a}}henb{\"{u}}hl and Vladlen Koltun},
booktitle={NIPS},
year={2011}
}
DeepLabv2 currently achieves 79.7% on the challenging PASCAL VOC 2012 semantic image segmentation task -- see the leaderboard.
Please refer to our project website for details.
We have released several trained models and corresponding prototxt files at here. Please check it for more model details.
Check FAQ if you have some problems while using the code.
There are several variants of DeepLab. To begin with, we suggest DeepLab-LargeFOV, which has good performance and faster training time.
Suppose the codes are located at deeplab/code
Seyed Ali Mousavi has implemented a python version of run_pascal.sh (Thanks, Ali!). If you are more familiar with Python, you may want to take a look at this.