Open jameslahm opened 5 months ago
I completed this work during my internship in Alibaba. I only took a portion of the code including classification and object detection when I resigned. The results of semantic segmentation are not good so that we only showed them in the supplementary materials. We tuned many parameters but excuse me that I do not recall them back. I am sorry again about that.
Thanks for your great work! I try to leverage the STViT-R-Swin-S in the semantic segmentation task according to Sec 6.3 in the paper. I use the pretrained checkpoint of STViT-R-Swin-S in #5 with the Top-1 accuracy of 82.43%. I use the https://github.com/SwinTransformer/Swin-Transformer-Semantic-Segmentation code and replace the
configs/swin/upernet_swin_small_patch4_window7_512x512_160k_ade20k.py
file with the followsI copy the backbone file from https://github.com/changsn/STViT-R-Object-Detection/blob/main/mmdet/models/backbones/swin_transformer.py and only change the 18-th line to
from mmseg.utils import get_root_logger
. However, I only obtain the 46.36 mIoU using the--aug-test
. It has a gap with the 48.3 mIoU in the Table 12 in the paper. Could you please give me some guidance on how to correctly reproduce the result? Thanks a lot. I'd appreciate it very much.