Closed dusty-nv closed 6 years ago
I think it could be a problem when the states are produced by a different model. I got it working by replacing the line
next_state_values[non_final_mask] = model(non_final_next_states).max(1)[0]
with
next_state_values[non_final_mask] = model(non_final_next_states).detach().max(1)[0]
This seems to remove any references to the model that originally produced the state.
When running
reinforcement_q_learning.py
from DQN tutorial against pyTorch master, the program crashes with errors:Any known workarounds/updates?