Batch Learning - Githubissues

Works reasonably for all non-pixel tasks. For pixel tasks its hella slow, I don't know if thats my fault. @HansBambel should look into that since he has the experience with this Pixel stuff.

I had to change/remove/move some things from the learners to the agent. E.g. representing the action as a one-hot-vector.