Open axsk opened 1 year ago
The problem seems to be with sum and not the derivative wrt. x
sum(abs2, sc(x, p_nn)
instead of the simple sum
works
However exchanging the gradient to be wrt. the parameters leads to the same StackOverflow
Zygote.gradient(p_nn) do p
sum(sc(rand(2), p))
end
The solution here will be to add a few methods that support Fill
arrays.
returns
using Julia 1.8.1 and SimpleChains 0.3.1