Add action info of argmax qf pol

DeepX-inc / machina

Control section: Deep Reinforcement Learning framework

MIT License

279 stars 43 forks source link

Closed takerfume closed 5 years ago

rarilurelo commented 5 years ago

If some policy who want to pass different shapes in a_i, this solution will fail. But for now it is better solution.