-
Hello @amirgholami ,
I´m super excited about this optimizer. Thank you!
I want to use it in a NER task using AllenNLP. But I´m confused because the code differs between the image_classification …
-
Hi,
thank you for the implementation. Unfortunately I am having some problems with one variable that is not defined not sure what it's supposed to be:
File "adahessian.py", line 77, in set_hess…
-
Apologies if I have this wrong, but is there code for the language modelling experiments? I think that /transformer only contains the NMT experiments. Thanks.
-
Hello,
I'm a little confused of your experimental settings on ImageNet. Could you please clairify the following questions?
1/ The initial learning rate is set to 0.15. That is to say, weight de…
-
Hi,
I tried the "Advanced usage" example code you showed in your README to accumulate multiple Hessians. I did this by calling set_hessian() multiple times for the same input to get a better estima…