SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

guanfuchen / semseg

常用的语义分割架构结构综述以及代码复现华为媒体研究院图文Caption、OCR识别、图视文多模态理解与生成相关方向工作或实习欢迎咨询 15757172165 https://guanfuchen.github.io/media/hw_zhaopin_20220724_tiny.jpg

768 stars 164 forks source link

Open guanfuchen opened 5 years ago

guanfuchen commented 5 years ago

use the code for training CamVid, here is the sample result

when the arch segnet-vgg16 change to segnet-vgg19, the detail about pedestrian is more clear.

guanfuchen commented 5 years ago

Training Skills:

We perform local contrast normalization [54] to the RGB input.
We use median frequency balancing [14] where the weight assigned to a class in the loss function is the ratio of the median of class frequencies computed on the entire training set divided by the class frequency.

** local contrast normalization