Closed pellegreene closed 5 months ago
Notes on two of the models: nasnet_large_pytorch -600G RAM Used slurmstepd: error: Detected 1 oom-kill event(s) in StepId=36556096.batch. Some of your processes may have been killed by the cgroup out-of-memory handler.
inception_v4_pytorch File "<__array_function__ internals>", line 6, in concatenate numpy.core._exceptions.MemoryError: Unable to allocate 340. GiB for an array with shape (1000, 91216808) and data type float32
Have files for:
Will be validating these and hopefully queueing them up on OM today for a first pass through on MajajHong2015.IT-pls.
Verify the scores on the remaining Top 25 models: