Closed chaobiubiu closed 5 years ago
_scale_surrogatereward and _scale_taskreward of CuriosityEnvWrapper are passed as gin-config parameters. If you are not familiar with gin, you can read about it there.
In our code, gin parameters are passed by in launcher_script.py. See for instance this line.
(Closing the issue, since this should answer your question)
_scale_surrogatereward and _scale_taskreward of CuriosityEnvWrapper are passed as gin-config parameters. If you are not familiar with gin, you can read about it there.
In our code, gin parameters are passed by in launcher_script.py. See for instance this line.
(Closing the issue, since this should answer your question)
Thank you.
In the curiosity_env_wrapper.py, the step_wait function returns postprocessed_rewards. However, I see that postprocessed_rewards = (self._scale_task_reward rewards +scale_surrogate_reward bonus_rewards) where scale_surrogate_reward is set to 0. I want to konw how to pass the episodic curiosity reward to the ppo training process. It's likely that I have lost something in reading the code, but I can't solve this problem myself. I am sincerely hoping that you can help me.