Does normalized rewards works with other Agents for Attari ?

openai / large-scale-curiosity

Code for the paper "Large-Scale Study of Curiosity-Driven Learning"

805 stars 180 forks source link

Open zafarmah92 opened 5 years ago

zafarmah92 commented 5 years ago

Using the normalized reward (#6 ) with the other agent's, taking the example of A2C where the discounted rewards are used on the extrinsic reward.

Now to which extent we need to normalize the intrinsic rewards as the forward loss (#3 ) tend to get low with the passage of time.
Second, Does the scaling of rewards also requires advantage normalization. ? In the perspective of agents A2C, ACKTR etc