Closed blancamiller closed 3 months ago
Hi @blancamiller , thanks for using brax! Could you please describe a bit more about your use-case? What do you mean by parameter values for each time-step? Parameters don't change during a rollout, they get updated over a batch of episodes.
I am using the provided Brax Training tutorial for PPO + Ant. The notebook provides a user-defined callback function
progress
. Similarly you can specify apolicy_param
function and pass it intotrain
. This is where my issues lies. I’m able to get the parameter values for each episode, however, I’d like to access the parameters for each time-step. How can I go about doing that?