train and test on my own dataset

MendelXu / SAN

Open-vocabulary Semantic Segmentation

MIT License

295 stars 27 forks source link

Hi! I had some problems when I changed the datasets. My datasets have 4 foreground classes and 1 background class. I followed other issues and registered in ./san/data/datasets/register.py and __init__.py. I want to compute the mIoU in both foreground and background.

I set CLASS_NAMES=(background, ...) in register.py (including background). And I set MODEL.SAN.NUM_CLASSES 5. I didn't change mask_cls=F.softmax(mask_cls, dim=-1)[..., :-1]. Do you think the mIoU I calculated this way is reasonable?
I don't understand why the output of mask_cls=F.softmax(mask_cls, dim=-1) is relevant to ..X..X..6 shape (6 classes). And you delete the last dimension [..., :-1]. Is it something to do with 255?
While I set my dataset, I saw the RGB image root and semantic segmentation ground truth root. But where is your class label root? Does the label use image-level labels or semantic segmentation ground truths? Thanks!

MendelXu / SAN

train and test on my own dataset #49