-
at `a3c_agent.step`,
action is chosen by `act_id = valid_actions[np.argmax(non_spatial_action[valid_actions])]`
However I think they should be chosen randomly by their probability because `non_spa…
-
I am working on reinforcement learning task and it requires calculating prediction too many times. I have found that 56,87% of cumulative time is taken by **_predict_loop** method. Also I have found t…
-
## Configuration:
- Untuned: 10 repeats, and median stability score is taken. Default parameters.
- Tuning: On Spark, 300 evaluations
## Results:
- Delta, Positive, tuning is better.
- Different Combi…
-
Apologies in advance if this has already been discussed else where and please do link me to any such prior discussion.
I've read:
https://github.com/cosmos/cosmos/blob/master/PURPOSE.md
https:/…