Open steviet91 opened 4 years ago
On the set of tracks, they could become progressively more difficult as they are completed - no point giving it an impossible task when it's failed 100 times on an easy one, but if it's learnt the simple 'go' then we could look at adding features to train it a bit at a time.
Probably worth implementing it as 'each time it completes a track, a harder one is added to the track list, and for every episode, a track is selected at random from the list.' That should help stop it from forgetting how to solve the easy tracks.
I'll push my rlagent branch today too, so you can hopefully merge what you've done on the OpenAI gym formatting so we're working on the same page/code
@Thonners - Thoughts on the training track: