num_classes in segmentation examle

pytorch / vision

Datasets, Transforms and Models specific to Computer Vision

BSD 3-Clause "New" or "Revised" License

16.21k stars 6.95k forks source link

@datumbox will be able to correct me if I'm wrong here:

Since the pretraining classes is 21, should the target mask be integers from 0 to 21, where 0 is the background?

Yes.

do we need to compute loss over 0 (background)?

I believe so - the ability to detect background over other classes is a required feature of the models, so it has to be explicitly encoded in some way.

I see that here we are ignoring index 255, but I am not sure where 255 comes from.

I believe it comes from here: https://github.com/pytorch/vision/blob/6d85d74be6ac72b2ac3057d85d8ae0004fa0de3f/references/segmentation/coco_utils.py#L55-L56

pytorch / vision

num_classes in segmentation examle #5822