Closed TranThanh96 closed 5 months ago
Of course! We will upload the corresponding configuration files in the next few days.
Hi~ @TranThanh96 We have uploaded the relevant configuration file.
@DanJun6737 can you provide your training log? my loss is a little bit high
Hi @TranThanh96 ~ Sorry, some training logs have been lost. Both the DPAP strategy and the EHSM strategy can cause the loss of the model to be higher in the early stages of training compared to a normal ViT model, but it will quickly decrease over time. Additionally, our configuration files are based on 8 V100 GPUs, so if a different number of GPUs is used, the corresponding configuration parameters need to be adjusted accordingly.
Can you provide VitL config to train glint360k?