Failure to train model on cpu

AI4HealthUOL / SSSD-ECG

Repository for the paper: 'Diffusion-based Conditional ECG Generation with Structured State Space Models'

MIT License

42 stars 6 forks source link

Failure to train model on cpu #8

Closed bigQY closed 1 year ago

bigQY commented 1 year ago

I adjusted the running environment of the code from cuda to cpu and found that the model doesn't run properly

The grad of the calculated loss was none, and after optimiser.step(), the parameters of the model changed to nan, and the training could not be continued.

Can you help me with this or May I ask what kind of environment you have over there, including python and pytorch versions, hardware informations, etc.?

juanlopezcode commented 1 year ago

hey, pytorch 1.13.0, python 3.10.8, gpu a30, cuda version 11.6

bigQY commented 1 year ago

I have run the model successfully on cuda, thanks for your help!