Open TheButlah opened 5 years ago
Status update: the necessary parallelized and batched environments have been implemented in #25, allowing for synchronous learning methods such as A2C and Synchronous Q Learning. Additionally the model API has been formalized in #23 , enabling Agents
to be written that interact with that API. Additionally, a working environment step loop using the BatchedEnv
was implemented in #25, with comments for where to place the agent-enviroment interaction. Hence all that is left for any synchronous RL method, either this one or #6 is the actual Agent
API and implementation, as well as a suitable concrete Model
.
Lets implement that shiz yo