Finish `runner.py/BDQTrainer.train()`

didclab / RL-Optimizer

The RL optimization work by Jamil, Elvis, and Jacob in DIDCLAB

0 stars 2 forks source link

Closed elrodrigues closed 1 year ago

elrodrigues commented 1 year ago

This can be ported from the bdq tests, though it must be similar in spirit to DDPGTrainer.train()