Open vlad17 opened 6 years ago
Apply the jointly learned dynamic/encoder encoding from #26 to the encoder used in #31 and see if PPO improves
Blocked on #26 #31 Deliverables: learning curve for PPO under new state
Apply the jointly learned dynamic/encoder encoding from #26 to the encoder used in #31 and see if PPO improves
Blocked on #26 #31 Deliverables: learning curve for PPO under new state