alizandian / safe-split-dqn

Safer Deep Reinforcement Learning (DRL) by iteratively updating domain variables including Safety Monitors. Currently extending work from here: https://github.com/jinwooro/Safety-RL-Generic
3 stars 0 forks source link