Open brettkoonce opened 4 years ago
Thanks for opening up the issue alongside with repro case! I just want to add my 2 cents that I initially discovered this problem with AMSGrad
which is not working as well.
ugh you're right i copy pasted the wrong call, fixed!
all credit to @s1ddok for reporting this in https://github.com/tensorflow/swift-models/pull/394!
something is failing (i think block-based reduces) with the adamax/AMSGrad optimizers:
[edit: updated 2020-05-17 to match latest swift-models api changes, still failing against master]
full trace: