Open Shengyu-Feng opened 5 years ago
I think when actor is generating the next state, it fails to use the initial state from last cell, as in this line trainer.py
Agree
I think when actor is generating the next state, it fails to use the initial state from last cell, as in this line trainer.py