Open BingyuanW opened 2 years ago
Thanks for the great work !
I wonder why the in_channels of decode_head is [ 224, 368, 480, 480 ] rather than [ 128, 224, 368, 480 ] for the MPViT-Base ?
Looking forward to your reply. Thanks again.
@BingyuanW Hi
As mentioned in our paper, each stage outputs the feature maps with the number of the next stage embedding channel size.
Thanks for the great work !
I wonder why the in_channels of decode_head is [ 224, 368, 480, 480 ] rather than [ 128, 224, 368, 480 ] for the MPViT-Base ?
Looking forward to your reply. Thanks again.