I adjusted the running environment of the code from cuda to cpu and found that the model doesn't run properly
The grad of the calculated loss was none, and after optimiser.step(), the parameters of the model changed to nan, and the training could not be continued.
Can you help me with this or May I ask what kind of environment you have over there, including python and pytorch versions, hardware informations, etc.?
I adjusted the running environment of the code from cuda to cpu and found that the model doesn't run properly
The grad of the calculated loss was none, and after optimiser.step(), the parameters of the model changed to nan, and the training could not be continued.
Can you help me with this or May I ask what kind of environment you have over there, including python and pytorch versions, hardware informations, etc.?