faris-k / fastsiam-wafers

Self-Supervised Representation Learning of Wafer Maps with FastSiam
MIT License
6 stars 0 forks source link

Segmentation fault, CUDNN_STATUS_EXECUTION_FAILED, and launch time out #10

Closed faris-k closed 1 year ago

faris-k commented 1 year ago

Consistent crashes occur with any nonzero value of num_workers on Windows systems. Multiprocessing on Linux doesn't seem to have this issue, but the added abstraction of WSL2 leads to low GPU utilization and thus lower throughput.

After painful trial and error, here are a few observations.

Possible course of action:

faris-k commented 1 year ago

Not a total resolution, but some notes: