Weakly-Segmentation

List of useful codes and papers for weakly supervised Semantic/Instance/Panoptic/Few Shot Segmentation

Weakly-Segmentation
- Top Work
  - By Dataset
    - PASCAL VOC2012
  - By Years
    - ICCV2019
- Resources
  - Tutorial
- Implementation
Related Tasks
Reading List
- Under Review
- Published
  - context
  - graph
  - bbox-level
  - webly
  - Saliency
  - localization
  - spp
  - affinity
  - region
  - network
  - regularizer
  - evaluation measure
  - architecture
  - generative adversarial
  - scene understanding
  - other useful
  - application
- Others

Top Work

By Dataset

PASCAL VOC2012

method	val	test	notes
DSRG_CVPR2018	61.4	63.2	deep seeded region growing, resnet-lfov\|vgg-aspp
psa_CVPR2018	61.7	63.7	pixel affinity network, resnet38
MDC_CVPR2018	60.4	60.8	multi-dilated convolution, vgg-lfov
MCOF_CVPR2018	60.3	61.2	iterative, RegionNet(sppx), resnet-lfov
GAIN_CVPR2018	55.3	56.8
DCSP_BMVC2017	58.6	59.2	adversarial for saliency, and generate cues by cam+saliency(harmonic mean)
GuidedSeg_CVPR2017	55.7	56.7	saliency, TBD
BDSSW_CVPR2018	63.0	63.9	webly, filter+enhance
WegSeg_arxiv	63.1	63.3	webly(pure), Noise filter module
SeeNet_NIPS2018	63.1	62.8	based on DCSP
Graph_ECCV2018	63.6	64.5	graph partition
Graph_ECCV2018	64.5	65.6	use simple ImageNet dataset additionally
CIAN_CVPR2019	64.1	64.7	cross image affinity network
FickleNet_CVPR2019	64.9	65.3	use dropout (a generalization of dilated convolution)

By Years

ICCV2019

Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation
Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation

CVPR2019

FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference

Resources

see this for more weakly lists and resources.
see this for more semantic/instance/panoptic/video segmentation lists and resources. see this for more implementations
a good architecture summary paper:Learning a Discriminative Feature Network for Semantic Segmentation

Tutorial

Unsupervised Visual Learning Tutorial. CVPR 2018 [part 1] [part 2]
Weakly Supervised Learning for Computer Vision. CVPR 2018 [web] [part 1] [part 2]

Implementation

pytorch-segmentation-detection a library for dense inference and training of Convolutional Neural Networks, 68.0%

rdn Dilated Residual Networks, 75.6%, may be the best available semantic segmentation in PyTorch?

Detectron.pytorch A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available. only for coco now

AdvSemiSeg Adversarial Learning for Semi-supervised Semantic Segmentation. heavily borrowed from a pytorch DeepLab implementation (Link)

PyTorch-ENet PyTorch implementation of ENet

tensorflow-deeplab-resnet Tensorflow implementation of deeplab-resnet(deeplabv2, resnet101-based): complete and detailed

tensorflow-deeplab-lfov Tensorflow implementation of deeplab-LargeFOV(deeplabv2, vgg16-based): complete and detailed

resnet38 Wider or Deeper: Revisiting the ResNet Model for Visual Recognition: implemented using MXNET

pytorch_deeplab_large_fov: deeplab v1

pytorch-deeplab-resnetDeepLab resnet v2 model in pytorch

DeepLab-ResNet-Pytorch Deeplab v3 model in pytorch,

BDWSS Bootstrapping the Performance of Webly Supervised Semantic Segmentation

psa Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation

DSRG: Caffe, CAM and DRFI provided

SEC

original: Caffe
BDSSW: MXNET
SEC-tensorflow: tensorflow

Related Tasks

Few-shot segmentation

[ ] One-shot learning for semantic segmentation, BMVC2017
[ ] Conditional networks for few-shot semantic segmentation, ICLR2018 Workshop
[ ] Few-Shot Segmentation Propagation with Guided Networks, preprint
[ ] Few-Shot Semantic Segmentation with Prototype Learning, BMVC2018
[ ] Attention-based Multi-Context Guiding for Few-Shot Semantic Segmentation, AAAI2019
[ ] CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning, CVPR2019
[ ] One-Shot Segmentation in Clutter, ICML 2018

Weakly-supervised Instance Segmentation

[x] Weakly Supervised Instance Segmentation using Class Peak Response, CVPR2018
[ ] Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR2019
[ ] Object Counting and Instance Segmentation with Image-level Supervision, CVPR2019
[x] Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation, CVPR2019
[x] Where are the Masks: Instance Segmentation with Image-level Supervision, BMVC2019
[ ] Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation, ICCV2019

Weakly-supervised Panoptic Segmentation

[ ] Weakly- and Semi-Supervised Panoptic Segmentation, ECCV2018

Reading List

Under Review

[ ] Gated CRF Loss for Weakly Supervised Semantic Image Segmentation
[ ] Closed-Loop Adaptation for Weakly-Supervised Semantic Segmentation
[ ] Harvesting Information from Captions for Weakly Supervised Semantic Segmentation
[ ] Consistency regularization and CutMix for semi-supervised semantic segmentation
[ ] Zero-shot Semantic Segmentation
[x] Self-supervised Scale Equivariant Network for Weakly Supervised Semantic Segmentation, propose an scale equivariant regularization.

Published

context

[x] Context Encoding for Semantic Segmentation: CVPR2018. use TEN
[ ] The Role of Context for Object Detection and Semantic Segmentation in the Wild: CVPR2014
[ ] Objects as Context for Detecting Their Semantic Parts: CVPR2018
[ ] Exploring context with deep structured models for semantic segmentation: TPAMI2017
[ ] dilated convolution
[ ] Deep TEN: Texture encoding network !!: CVPR2017. A global context vector, pooled from all spatial positions, can be concatenated to local features
[ ] Refinenet: Multi-path refinement networks for high-resolution semantic segmentation: CVPR2017. local features across different scales can be fused to encode global context
[x] Non-local neural networks: CVPR2018. a densely connected graph with pairwise edges between all pixels

graph

[ ] Associating Inter-Image Salient Instances for Weakly Supervised Semantic Segmentation: ECCV2018

bbox-level

Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation, CVPR2019

webly

[x] Weakly Supervised Semantic Segmentation Based on Web Image Cosegmentation: BMVC2017, training model using masks of web images which are generated by cosegmentation
[ ] Webly Supervised Semantic Segmentation: CVPR2017
[x] Weakly Supervised Semantic Segmentation using Web-Crawled Videos: CVPR2017, learns a class-agnostic decoder(attention map -> binary mask), pseudo masks are generated from video frames by solving a graph-based optimization problem.
[x] Bootstrapping the Performance of Webly Supervised Semantic Segmentation: target + web domain, target model filters web images, refine mask by combine target and web masks.
[ ] Learning from Weak and Noisy Labels for Semantic Segmentation: TPAMI2017
[x] WebSeg: Learning Semantic Segmentation from Web Searches: arxiv, directly learning from keywork retrievaled web images. using saliency and region(MCG with edge)
[x] STC: A Simple to Complex Framework for Weakly-supervised Semantic Segmentation: TPAMI 2017, Initial, Enhanced, Powerful three DCNN model. inital mask(generated by saliency and label using simple images) -> initial model -> enhanced mask(generated using simple images) -> Enhanced model -> powerful mask(generated using complex images) -> powerful model
- saliency can not handle complex images, so BMVC2017 uses coseg instead

Saliency

[x] Exploiting Saliency for Object Segmentation from Image Level Labels: CVPR2017
[x] Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation: BMVC2017
- combine saliency(off-shelf) and CAM to get cues, use harmonic mean function
- adapt CAM from head of Segmentation Network
- use erasing to get multiple objects' saliency

localization

[x] Adversarial Complementary Learning for Weakly Supervised Object Localization, CVPR2018. two branchs, remove high activations from feature map. code
[x] Tell me where to look: Guided Attention Inference Network, CVPR2018. origin image soft erasing(CAM after sigmoid as attention) -> end2end training, force erased images have zero activation
[x] Self-Erasing Network for Integral Object Attention， NIPS2018: prohibit attentions from spreading to unexpected background regions.
- cam -> tenary mask(attention, background, potential)
- self erasing only in attention + potential region(sign flip in background region instead of setting to 0 simply)
- self produced psedo label for background region(difference to SPG: 1.psedo label for background and attention 2.supervise low layer)
[x] Self-produced Guidance for Weakly-supervised Object localization, ECCV2018:
- self supervised use top down framework, for single label classification prob. add pixel-wise supervision when only have image level label
- B1, B2 sharing
- bottom guide top inversely(B1+B2 -> C)

spp

[ ] Superpixel convolutional networks using bilateral inceptions
[x] Learning Superpixels with Segmentation-Aware Affinity Loss: good intro for superpixel algs.

affinity

[x] Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation: image-level, semantic affinity, learn a network to predict affinity
[x] Adaptive Affinity Field for Semantic Segmentation: ECCV2018, semantic affinity. add a pairwise term in seg loss(similarity metric: KL divergence), use an adversarial method to determine optimal neighborhood size

region

[ ] Region-Based Convolutional Networks for Accurate Object Detection and Segmentation
[ ] Simultaneous Detection and Segmentation, 2014
[ ] Feedforward semantic segmentation with zoom-out features: 2015

network

[ ] Learned Shape-Tailored Descriptors for Segmentation
[ ] Normalized Cut Loss for Weakly-Supervised CNN Segmentation
[ ] Fully Convolutional Adaptation Networks for Semantic Segmentation
[ ] Learning to Adapt Structured Output Space for Semantic Segmentation
[x] Semantic Segmentation with Reverse Attention: BMVC2017, equally responses of multi classes(confusion in boudary region). add reverse branch, predict the probability of pixel that doesn't belong to the corresponding class. and use attention to combine origin and reverse branch
[x] Deep Clustering for Unsupervised Learning of Visual Features, ECCV2018. use assignments of knn as supervision to update weights of network
[x] DEL: Deep Embedding Learning for Efficient Image Segmentation, IJCAI 2018. use spp embedding as init probs to do image segmentation
[x] Learning a Discriminative Feature Network for Semantic Segmentation, CVPR2018, Smoother network: multi-scale+global context(FPN with channel atention), Broder Network: focal loss for boundary. code?
[ ] Convolutional Simplex Projection Network for Weakly Supervised Semantic Segmentation: BMVC 2018
[ ] Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation: CVPR2019

regularizer

evaluation measure

architecture

[ ] The Devil is in the Decoders, BMVC2017
[x] Dilated Residual Networks, CVPR2017. Dilated structure design for classification and localization.
[x] Understanding Convolution for Semantic Segmentation, WACV2018. hybrid dilated convolution(2-2-2 -> 1-2-3)
[x] Smoothed Dilated Convolutions for Improved Dense Prediction, KDD2018. separable and share conv(for smoothing) + dilated conv
[x] Deeplab v1, v2, v3, v3+
[ ] Learning Fully Dense Neural Networks for Image Semantic Segmentation, AAAI2019

generative adversarial

[ ] Deep dual learning for semantic image segmentation:CVPR2017, image translation
[x] Semantic Segmentation using Adversarial Networks, NIPS2016 workshop
- add gan loss branch, Segnet as generator, D: GT mask or predicted mask
[x] Adversarial Learning for Semi-Supervised Semantic Segmentation: BMVC2018
- semi supervised: SegNet as G, FCN-type D(discriminate each location), use output of D as psedo label for unlabeled data
[x] Semi and weakly Supervised Semantic Segmentation Using Generative Adversarial Network: ICCV2017, use SegNet as D, treat fake as new class
- weakly, use conditionalGan, pixel-level, image-level, generated data are included in loss. performance boosts less when increasing fully data
[ ] generative adversarial learning towards Fast weakly supervised detection: CVPR2018
[x] Adaptive Affinity Field for Semantic Segmentation: ECCV2018, semantic affinity. add a pairwise term in seg loss(similarity metric: KL divergence), use an adversarial method to determine optimal neighborhood size

scene understanding

[ ] ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans
[ ] SeGAN: Segmenting and Generating the Invisible

other useful

[ ] Learning to Segment Every Thing: semi-supervised, weight transfer function (from bbox parameters to mask parameters)
[ ] Simple Does It: Weakly Supervised Instance and Semantic Segmentation: bbox-level, many methods, using graphcut, HED, MCG
[ ] Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning: tricky, curriculum learning: image level -> instance level -> pixel level
[ ] Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation: CVPR2017
[x] Improving Weakly-Supervised Object Localization By Micro-Annotation: BMVC2016, object classes always co-occur with same background elements(boat, train). propose a new annotation method. add human annotations to improve localization results of CAM, annotating based on clusters of dense features. each class uses a spectral clustering.(CAM has problem)
[x] Co-attention CNNs for Unsupervised Object Co-segmentation: IJCAI 2018
[ ] Coarse-to-fine Image Co-segmentation with Intra and Inter Rank Constraints, IJCAI2018
[ ] Annotation-Free and One-Shot Learning for Instance Segmentation of Homogeneous Object Clusters, IJCAI2018
[x] Image-level to Pixel-wise Labeling: From Theory to Practice: fully, analysis the effect of image labels on seg results. add a generator(recover original image). image label(binary, use a threshold small than 0.5, eg:0.25), IJCAI2018

application

[x] SeGAN: Segmenting and Generating the Invisible: CVPR2018, generate occluded parts
[x] Learning Hierarchical Semantic Image Manipulation through Structured Representations: NIPS2018, manipulate image on object-level by modify bbox

Others

priors

Superpixels: An Evaluation of the State-of-the-Art link
Learning Superpixels with Segmentation-Aware Affinity Losslink
Superpixel based Continuous Conditional Random Field Neural Network for Semantic Segmentation link

diffusion

Learning random-walk label propagation for weakly-supervised semantic segmentation: scribble

Convolutional Random Walk Networks for Semantic Image Segmetation: fully, affinity branch(low level)

Soft Proposal Networks for Weakly Supervised Object Localization: attention, semantic affinity

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation: image-level, semantic affinity

analysis

image level to pixel wise labeling: from theory to practice: IJCAI 2018 analysis the effectiveness of class-level labels for segmentation(GT, predicted) Attention based Deep Multiple Instance Learning: ICML 2018. CAM from MIL perspective view

post processing

listed in : Co-attention CNNs for Unsupervised Object Co-segmentation

Otsu’s method
GrabCut
CRF

common methods

refine segmentation results using image-level labels
multi-label classification branch(BDWSS)
generative branch(to original image)
crf

kevinlee9 / Semantic-Segmentation

readme