Do you know why jvp is a scalar? I would have thought that this is a matrix. Also, is there a reason why we are calling tuple(params) instead of params?
i also found that the jvp is a scalar, and i'm not sure how the jvp is calculated. i want a formula to show the computing flow layer by layer, do you know how to get the formula?
Hi - Great work!
I have one question:
_loss, jvp = fc.jvp(f, (tuple(params),), (vparams,))
Do you know why jvp is a scalar? I would have thought that this is a matrix. Also, is there a reason why we are calling tuple(params) instead of params?
Thank you.