vub-ai-lab / bdpi

Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
GNU General Public License v3.0
25 stars 5 forks source link