To reduce the run time it would be nice if the gradient descending would run parallel on all cores - or the amount you give the script. So first one run without parallelisation to get the direction of the gradient. After that parallel stepwise on all given cores.
To reduce the run time it would be nice if the gradient descending would run parallel on all cores - or the amount you give the script. So first one run without parallelisation to get the direction of the gradient. After that parallel stepwise on all given cores.