Open zyzzyxdonta opened 2 years ago
Interesting. Thank you for reporting this. The let's go for 120min.
If you have other time limits in mind, please let me know.
I tried 90 and that wasn't enough. I'll try 120 tomorrow.
snakemake --report report.html
is quite interesting (even if this picture isn't too precise):For resnet and resnext, the two GPU types can clearly be distinguished 😄
With the hemera config, (some?) jobs for the rules
imagenette2_resnet50_default
andimagenette2_resnext50_default
run into the time limit of 75 minutes. I think this only happens when they are scheduled on the nodes with P100 cards. The jobs on V100 cards seem to run fine with just over 60 minutes runtime.So either, the config should set a longer runtime for these rules, or the partition should be set to gpu_v100. But since not a lot of V100 cards are freely available on hemera, I think it would be best to go with the first option. I don't know by how much the runtime should be increased, though.