Closed mmterkc closed 3 years ago
I trained rl model but i get different reward values same env and same steps and same model. Is it normal ?
my model is ACKTR
See the docs on general tips on RL: Results vary wildly between runs. You have to set seeds correctly (see this section).
I trained rl model but i get different reward values same env and same steps and same model. Is it normal ?