broadinstitute / ml4h

Other
122 stars 23 forks source link

Better allocation of workers for train and validation sets #260

Open ndiamant opened 4 years ago

ndiamant commented 4 years ago

What Allocate workers based on number of train vs. validation steps, so that the validation workers don't pull way ahead of the train workers.

Why Better use of cpus for faster training.

How num train workers = training steps / (training steps + validation steps) or something

Acceptance Criteria Workers allocated based on number of train vs. validation steps. Speed comparison would be good, ideally through a test!

StevenSong commented 4 years ago

hm to be honest I think my fix in #327 made it worse, will reopen