Open Nicolas99-9 opened 6 years ago
What is the difference between these two files and which one should I use in order to train a deep gradient policy RL agent ?
I tried to run pg_model with several NN architecture but I always have a convergence of the reward around 200, have you tried it before ?
What is the difference between these two files and which one should I use in order to train a deep gradient policy RL agent ?
I tried to run pg_model with several NN architecture but I always have a convergence of the reward around 200, have you tried it before ?