Closed HansBambel closed 5 years ago
By now we only train on one obstacle map. This will in the long run result in overfitting. That's why we should create different maps that can be created when env.reset() is called.
Another thing that may speed up learning: reduce the map size.
I think it is better to first resolve #20 before working on this.
This is included in the current multi-task pull request
By now we only train on one obstacle map. This will in the long run result in overfitting. That's why we should create different maps that can be created when env.reset() is called.