Open danisch-khurshid-creator opened 4 years ago
Try this... it should work...
target_actions = np.array([[1 if a==i else 0 for i in range(2)] for a in actions]) loss = network.train_on_batch(states,target_actions, sample_weight=discounted_rewards)
Hi. The code Code is not working with this line:
loss = network.train_on_batch(states, discounted_rewards)
.