A2C Implementation - Githubissues

Status update: the necessary parallelized and batched environments have been implemented in #25, allowing for synchronous learning methods such as A2C and Synchronous Q Learning. Additionally the model API has been formalized in #23 , enabling Agents to be written that interact with that API. Additionally, a working environment step loop using the BatchedEnv was implemented in #25, with comments for where to place the agent-enviroment interaction. Hence all that is left for any synchronous RL method, either this one or #6 is the actual Agent API and implementation, as well as a suitable concrete Model.

TheButlah / makrl

A2C Implementation #4