deepmodeling / dpdispatcher

generate HPC scheduler systems jobs input scripts and submit these scripts to HPC systems and poke until they finish
https://docs.deepmodeling.com/projects/dpdispatcher/
GNU Lesser General Public License v3.0
45 stars 57 forks source link

{question] anyway to set the number of resubmission fail jobs? #502

Closed thangckt closed 1 month ago

thangckt commented 1 month ago

Dear Devs,

I wonder there is anyway to set the number of resubmission on fail jobs? (or changing the default value}

Thanks

njzjz commented 1 month ago

For example,

machine.retry_count = 10
thangckt commented 1 month ago

hi @njzjz Can you show me where to set machine.retry_count?

I try in machine dict with

machine: {retry_count: 0} 

But is does not take effct.