thushv89 / AdaCNN

AdaCNN algorithm. Clean implementation
0 stars 0 forks source link

get mean activation instead of max activation and plot q values side by side #6

Closed thushv89 closed 7 years ago

thushv89 commented 7 years ago

Also have the weighted reward

thushv89 commented 7 years ago

plot_weighted_reward_vs_non_weighted