Example of Actor critic for large number of actions

coreylynch / async-rl

Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"

MIT License

1.01k stars 174 forks source link

Example of Actor critic for large number of actions #2

Open gowthamnatarajan opened 8 years ago

gowthamnatarajan commented 8 years ago

Is the a3c implementation done?

Do you have an example implementation of actor critic for large action spaces. I see that there is a example for a3c, but the action space for this problem is small.