Open dhensle opened 7 months ago
Completed on a 1TB machine with 48 cores. Used num_processes: 40
and no chunking. Completed in about 3 hours.
logs_no_sh_full_mp.zip (Run failed in final summarize step because the mp config was using a step not relevant for the ABM3 example, but doesn't make much impact on the runtime estimate.)
Ran multiprocessing with sharrow off and varied the number of cores (just like https://github.com/ActivitySim/sandag-abm3-example/issues/22#issuecomment-2223546038). Completed on a 500 GB, 24 core RSG machine.
Observations:
Performed multiprocessing tests with different numbers of processors on WSP server that has 512 GB RAM and 32 cores, with sharrow off, explicit chunking set to 0.2 (5 chunks), and using the zlib skims.
The observations are similar to David's. Many components see diminishing returns with more processors. The oddball is non-mandatory tour scheduling which only gets worse with more processors.
This is the issue to report on memory usage and runtime performance...
data_dir: "data-full"
full scale skims (24333 MAZs)households_sample_size: 0
(full scale 100% sample of households)sharrow: false
multiprocess: True