Open gunjitsingh1985 opened 2 years ago
Hello, first off, if you use runx, please make sure that LOGROOT is defined within your .runx file with an absolute path and that path exists :).
I'm not aware of people running this code on mac. The TPCStore address in use error is something that you tend to get if you try to run once, then ctrl-c, then run again and you didn't clean up the old run.
I also have no idea whether this code can run on a mac, so you're on your own there.
Hi All, I've been experiencing some issues getting a basic pre-trained model to run some inference. I have taken the following steps but to no Avail
The remediation I took for this was to ditch using runx altogether and use the default params specified in the 'dump_folder.yml' file directly from the command line, with the parameters in HPARAMS passed in directly
That generated the following "RuntimeError:Address already in use" error. Which persisted even when I took the "nproc_per_node" parameter down from 8 to 1
Screenshots below -a)
b)
c)