richardrl / rlkit-relational

Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"
MIT License
97 stars 10 forks source link

Process-level parallelism? #8

Open bycn opened 4 years ago

bycn commented 4 years ago

Let's say I want to use a new algorithm, i.e., TD3 HER, and a new replay buffer (i.e., samples goals at saving instead of at training). Which methods would need to be overridden / what's the high level overview of how MPI works for training?