[SingleContinuous, Continuous] TD3 agents

martyn-smith / Eastmann-Adversarial

Implementations of the Tennessee Eastmann process suitable for Adversarial Reinforcement Learning

0 stars 0 forks source link

Open martyn-smith opened 2 years ago

martyn-smith commented 2 years ago

As above, implement twin-delayed DDPG (or TD3) for the Continuous branches.