Currently the policy iteration approach loops over each x-value and optimises the control only at that position.
Is it possible (and faster) to run a larger optimisation over the controls on all x-values instead? Maybe by summing together hamiltonian(i,j) for all the indices i (and j in 2D)?
Currently the policy iteration approach loops over each x-value and optimises the control only at that position. Is it possible (and faster) to run a larger optimisation over the controls on all x-values instead? Maybe by summing together hamiltonian(i,j) for all the indices i (and j in 2D)?