google-research / episodic-curiosity

Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
Apache License 2.0
195 stars 34 forks source link

a little question about the code #1

Closed chaobiubiu closed 5 years ago

chaobiubiu commented 5 years ago

In the curiosity_env_wrapper.py, the step_wait function returns postprocessed_rewards. However, I see that postprocessed_rewards = (self._scale_task_reward rewards +scale_surrogate_reward bonus_rewards) where scale_surrogate_reward is set to 0. I want to konw how to pass the episodic curiosity reward to the ppo training process. It's likely that I have lost something in reading the code, but I can't solve this problem myself. I am sincerely hoping that you can help me.

RaphaelMarinier commented 5 years ago

_scale_surrogatereward and _scale_taskreward of CuriosityEnvWrapper are passed as gin-config parameters. If you are not familiar with gin, you can read about it there.

In our code, gin parameters are passed by in launcher_script.py. See for instance this line.

(Closing the issue, since this should answer your question)

chaobiubiu commented 5 years ago

_scale_surrogatereward and _scale_taskreward of CuriosityEnvWrapper are passed as gin-config parameters. If you are not familiar with gin, you can read about it there.

In our code, gin parameters are passed by in launcher_script.py. See for instance this line.

(Closing the issue, since this should answer your question)

Thank you.