Closed kvfrans closed 8 years ago
try out changing max_kl then.
Someone, wrote in the previous post at github that current code doesn't do line search. This could help as well.
Hi, you may find https://github.com/wojzaremba/trpo/pull/3 helpful.
Merged.
I'm trying to reproduce results on Copy-v0.
The if statement here is always being called, and the KL between old and new is always greater than 0.01 (max_kl). So no changes are being made to the policy.
I am running the script by
python main.py Copy-v0