question about training recipe

THU-MIG / RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Apache License 2.0

756 stars 56 forks source link

Hi,

Thanks for bring the work to public !! I have a question about experiments in Table 5.

In the paper, it is claimed that the training method of repVIT is identical to mobilenet-v3L, which consists of many modern training tricks. I believe the model used in Table 5 is also trained with these. It shows that resnet18 is a bit faster than repVIT-M1.1, but its down-stream task performance is much worse. Does the resnet18 model used here is also trained with mobilenet-v3L recipe, or it is only the original resnet18 model trained for 100 epoch without other tricks?

THU-MIG / RepViT

question about training recipe #38