sithu31296 / semantic-segmentation

SOTA Semantic Segmentation Models in PyTorch
MIT License
830 stars 149 forks source link

Double Checking implementation detail in SegFormerHead #29

Closed RahulSinghalChicago closed 2 years ago

RahulSinghalChicago commented 2 years ago

The paper on SegFormer suggests an All MLP decoder.

Screen Shot 2022-04-23 at 3 03 57 AM

The SegformerHead.py shows the use of a Conv2D for the final layer.

Screen Shot 2022-04-23 at 3 06 18 AM

Can you help me understand if this is a deviation from the paper or mentioned in a followup paper somewhere? I apologize in advance if there is an obvious answer.

sithu31296 commented 2 years ago

1x1 conv is basically the same as linear operation. In segmentation, 1x1 conv is usually used as the last layer. You can also check the official segformer implementation.

RahulSinghalChicago commented 2 years ago

Thank you!