Open nicholas0717 opened 1 year ago
Hi nicholas, I have also tried the code on my task. It turns out that the results are unsatisfying. Did you find a way out?
Hi nicholas, I have also tried the code on my task. It turns out that the results are unsatisfying. Did you find a way out?
so do i, Did you find a way out?
Hi @toshikwa :
I'm trying to implement other tasks with your GAIL framework, but it works badly. I think the network units and relevant parameters I chose are not appropriate. I have some questions.
I want to know how to choose the
rollout_length
for different tasks. You chose 2000 forInvertedPendulum-v2
and 50000 forHopper-v3
. Is there a criterion for choosingrollout_length
?The
hidden_activation
you chose are allTanh()
in discriminator, actor, critic. Is it better thanReLU()
in GAIL? And will thehidden_units
affect the final performance of the agent a lot?Last question. In your
Hopper-v3
example, the finalacc_exp
andacc_pi
is shown below. But in my task, theAccuracy Exp
is approaching 1. Is there something wrong with this?Hope you can give me some advice. Thank you so much.