didclab / RL-Optimizer

The RL optimization work by Jamil, Elvis, and Jacob in DIDCLAB
0 stars 2 forks source link

Finish `runner.py/BDQTrainer.train()` #5

Closed elrodrigues closed 1 year ago

elrodrigues commented 1 year ago

This can be ported from the bdq tests, though it must be similar in spirit to DDPGTrainer.train()