Closed ferenctorok closed 4 years ago
normalizing every observation and action to the range (-1, 1). Trained agent on SAC expert trajectories. Trained agent checkpoints are also pushed for further use.
normalizing every observation and action to the range (-1, 1). Trained agent on SAC expert trajectories. Trained agent checkpoints are also pushed for further use.