thank you for this great open-sourced library. Currently, I am trying to use PPO from this library in conjunction with ISAAC GYM.
More specifically, I am trying to find out a way to update the actions only once during the whole episode, which means one action from the action buffer should be sampled at the beginning of the episode and should remain constant until the episode ends. Is there a way for this?
Hi,
thank you for this great open-sourced library. Currently, I am trying to use PPO from this library in conjunction with ISAAC GYM.
More specifically, I am trying to find out a way to update the actions only once during the whole episode, which means one action from the action buffer should be sampled at the beginning of the episode and should remain constant until the episode ends. Is there a way for this?