regarding the number of classes and data augmentation mechanism

ljanyst / image-segmentation-fcn

Semantic Image Segmentation using a Fully Convolutional Neural Network in TensorFlow

87 stars 32 forks source link

It, in fact, is a binary problem. I think some of these labels are just broken. If you're not careful it may cause numerical instabilities in the model, though. See this note from the TensorFlow docs:

NOTE: While the classes are mutually exclusive, their probabilities need not be. All that is required is that each row of labels is a valid probability distribution. If they are not, the computation of the gradient will be incorrect.

I even wrote a script to check whether you always end up with a proper class probability distribution in your labels. Just treat everything that is not purple as background.

I did not do any augumentation. The original paper on page 7 says:

Augmentation We tried augmenting the training data by randomly mirroring and “jittering” the images by translating them up to 32 pixels (the coarsest scale of prediction) in each direction. This yielded no noticeable improvement.

ljanyst / image-segmentation-fcn

regarding the number of classes and data augmentation mechanism #2