DeepX-inc / machina

Control section: Deep Reinforcement Learning framework
MIT License
279 stars 45 forks source link

Add action info of argmax qf pol #163

Closed takerfume closed 5 years ago

rarilurelo commented 5 years ago

If some policy who want to pass different shapes in a_i, this solution will fail. But for now it is better solution.