[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
I wonder what is the use of SparK.forward in line 156 in main.py from pretrain? Is that necessary to call the forward function? I believe the defined model is actually model_without_ddp.
Similar issue in pretrain/viz_reconstruction, where the function show calls another spark.forward.
Dear author,
I wonder what is the use of SparK.forward in line 156 in main.py from pretrain? Is that necessary to call the forward function? I believe the defined model is actually model_without_ddp.
Similar issue in pretrain/viz_reconstruction, where the function show calls another spark.forward.
Looking forward to your response!