Closed Sharad24 closed 4 years ago
There's also a lot of code duplication issues that show up pylint. Maybe work on that too.
Yup, that's the goal. Another thing to do would be properly decide the parameters to be kept/removed in agents and added/removed from the trainers.
Since we already have the On Policy Agents, I'm renaming this to Off Policy. I'll raise a PR for this soon.
I'm thinking of refactoring each of the individual off policy algorithms first so that the code is neater, more uniform and shorter.
To-do:
Tracking in separate issues now. #263, #162 and #264
This should make the code much more comprehensible, especially with the number of arguments we have. And at the same time resolve a lot of maintainability issues.