Open albertma-evotec opened 4 years ago
Yeah, you are right cond_lb
and cond_rb
are not used. Still don't know what is the purpose.
lr_lp
refers to the learning rate of Learnable Prior and lr_dec
refers to the learning rate of Decoder
As in the reinforcement learning step, only the lp
and dec
are trained so optimizer_lp
and optimizer_dec
are used to update the parameters
in gentrl.py, the parameter cond_lb and cond_rb was not used in the body? What is the purpose of them? and why there are two learning-rate variable (lr_lp and lr_dec)? is lr_dec referring to the decay rate?
Thanks