Alpha-VL / ConvMAE

ConvMAE: Masked Convolution Meets Masked Autoencoders
MIT License
477 stars 41 forks source link

Doubts about masking strategy #22

Open aichifandefan opened 1 year ago

aichifandefan commented 1 year ago

Hi! Thanks for the opensource code. I have the doubts about masking strategy. In the paper: Uniformly masking stage-1 input tokens from the H/4 × W/4 featuremaps would cause all tokens of stage-3 to have partially visible information and requires keeping all stage-3 tokens. Why the visible information will pass to the stage-3, if the images was masked in the first. Thanks very much!