Some questions about infobatch at epoch0

NUS-HPC-AI-Lab / InfoBatch

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

318 stars 18 forks source link

Hi, thank you for your great work! I applied your infobatch to my own model in the way you provided. There are some uncertain phenomena to confirm with you.

At the 0th epoch, the loss of the model became twice that of not using Infobatch, which I believe is abnormal. Therefore, I conducted an investigation and found that this line of code was judged as true. However, All scores initialized to 3.0 should be equal to their mean, and well_learned_mask shoule be False at 0th epoch. I want to know if my understanding is correct, or if you intentionally designed it this way (double loss at the 0th epoch).
What should I do with "if the learning rate scheduler is epoch-based, adjust its steps accordingly at beginning of each epoch.". For example, my model's learning rate scheduler is reduce by 0.1 times at 10, 20, and 25 epochs. What should I do if I apply infobatch to my model.

NUS-HPC-AI-Lab / InfoBatch

Some questions about infobatch at epoch0 #25