lasso-net / lassonet

Feature selection in neural networks
MIT License
215 stars 52 forks source link

CUDA error: device-side assert triggered during stability_selection #69

Open ElrondL opened 5 days ago

ElrondL commented 5 days ago

I found that on occasion, stability selection will trigger CUDA error like below:

self.criterion(model(X_val), y_val).item() RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

Could this be due to the use of validation sets? This error often occurs in the middle of stability-selection process at random number of trials (e.g. sometimes it fails at 40/100, others it fails at 80/100...).

louisabraham commented 4 days ago

I have absolutely no idea. Stability selection shouldn't have anything to do with that error. Maybe try to update pytorch to the latest version?