stan-dev / stan

Stan development repository. The master branch contains the current release. The develop branch contains the latest stable development. See the Developer Process Wiki for details.
https://mc-stan.org
BSD 3-Clause "New" or "Revised" License
2.57k stars 368 forks source link

Exposing mu to the services (stepsize warmup) #2670

Open Tuxonomics opened 5 years ago

Tuxonomics commented 5 years ago

Summary:

Regarding the parameters of the dual averaging optimization for the stepsize in the warmup, all parameters can be set by the user except mu. For certain models, the initial choice of mu can result in a drastic drop of the stepsize for subsequent iterations. This might lead to extra computation time. As mu is adjusted for each window of the mass matrix adjustment, this can in effect add a significant amount of computation time.

Description:

Currently, mu is set to be log(10 * this->nom_epsilon_) which is hardcoded in the algorithm, and the services stan::services::sample::hmc_static_dense_e_adapt, stan::services::sample::hmc_static_diag_e_adapt, stan::services::sample::hmc_nuts_dense_e_adapt and stan::services::sample::hmc_nuts_diag_e_adapt do not expose the factor of epsilon to the user. This issue would entail

There are also the equivalent algorithms in nuts_classic, static_uniform and xhmc.

The default factor of 10 should be kept.

Additional Information:

Issues in Rstan, Pystan and CmdStan would have to be opened as well.

Current Version:

v2.18.0

bob-carpenter commented 5 years ago

Thanks for submitting this and for explaining it so well. Adding new arguments is a lot easier than removing them. I'm adding the "good first issue" tag not so much because this is simple, but because this issue describes what needs to be done so clearly and it's relatively modular and can be done by following the other parameters being passed.

avehtari commented 1 year ago

For the record, I'm adding links to two relevant discussion threads