Closed surya-chak closed 3 years ago
It would be good to get a stacktrace by terminating the run. Also, could you please format the code snippets in the post?
I would also try to run the different training steps by hand (that is, call Flux.gradient
and update
manually)
no stacktrace, no MWE, and OP not responsive, closing this
Flux.train! goes through the first iteration resulting in 1 call to the callback function after which the process seems to get stuck. We've used htop to verify that the Julia process stops using any CPU at all after a few minutes into this step.
We're are currently using Flux.train! for a NeuralODE problem, the code for which is attached below: