Open TheButlah opened 6 years ago
OK, I think this is completed, although I can't test it until we get the agent and trainer implemented. I'll mark as closed for now.
Reopening due to the fact that we now need this to be n-step and conform with the API
Implement a subclass of ActionModel which is a fully connected network to approximate q function. This will be used as a simple example for any TD methods.