Leap usage - Githubissues

amzn / metalearn-leap

Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot experiment presented in the paper.

Apache License 2.0

147 stars 36 forks source link

In your leap usage example, leap.update() is called within the for loop. However, shouldn't it be called right after the for loop as well? Since leap.update() does nothing but register _prev_state and _prev_loss on the first round, if the for loop is run five times, calling leap.update() five times will only update the first four gradient paths, missing out on the last (fifth) gradient step.

I think the final loss (for $\theta_K$) as well as leap.update() should be computed after the for loop. What do you guys think?

https://github.com/amzn/metalearn-leap/blob/1436ee3029bf5de7cd8e317b9bbf56ff02f46a6c/src/leap/leap/leap.py#L36-L60

amzn / metalearn-leap

Leap usage #5