Closed qiaoting159753 closed 8 months ago
Add an extra argument in TrainingConfig for training the world model. The world model can be trained several times for each time step. Therefore a separate G is needed.
Add an extra argument in TrainingConfig for training the world model. The world model can be trained several times for each time step. Therefore a separate G is needed.