About the meaning of `allow_nograd` in `MAML.adapt()`

learnables / learn2learn

A PyTorch Library for Meta-learning Research

MIT License

2.61k stars 350 forks source link

Hi,

I found the implementation of allow_nograd argument seems to contradict its meaning.

https://github.com/learnables/learn2learn/blob/0b9d3a3d540646307ca5debf8ad9c79ffe975e1c/learn2learn/algorithms/maml.py#L109-L129

According to its naming and comment, I believe it's meant to enable the model parameters with requires_grad=False to be fast adapted, thus commputing and later backpropagating through the gradients w.r.t them to contribute to the meta-gradients w.r.t the model parameters with requires_grad=True.

However in the implementation, the if allow_nograd: branch is actually filtering out those nograd parameters, i.e. setting gradient = None for them to be skipped in maml_update

https://github.com/learnables/learn2learn/blob/0b9d3a3d540646307ca5debf8ad9c79ffe975e1c/learn2learn/algorithms/maml.py#L138-L166

learnables / learn2learn

About the meaning of `allow_nograd` in `MAML.adapt()` #380