Parallel Training - Githubissues

On second thought, parallel training may be achieved without #11 by wrapping our current environment in a 'pool' wrapper.

This pool would have a manager or a cron job based on time or episodes that will periodically soft-sync models to a 'master' model to rapidly accumulate experience, assuming all jobs are normalized identically. This master model would then be distributed to env-threads in this pool for further (distributed) training.

didclab / RL-Optimizer

Parallel Training #12