Open OValery16 opened 4 years ago
Hi @OValery16 , Thanks for your interests. I'm not sure if I understand your question correctly, but I'll try to explain the lines you see.
The lines following "Long cycle index Base shape Epochs" describe the "long cycle" schedule when "long cycle" is enabled. If you also enable "short cycles", it doesn't change what's printed in those lines (because otherwise it might get harder to read), but internally short cycles are used. You may print out the shape of tensors obtained from the data loader to verify.
Thank you for releasing such very useful tool.
After reading "A Multigrid Method for Efficiently Training Video Models", I am a bit confused about the multigrid scheduling policy. In the paper, it is mentioned 4 approaches: baseline, long cycles, short cycles, long +short cycles (default setting)
However, after displaying the multigrid policy used in this repo, I get:
In the paper, it is stated that: