yngtdd / hyperspace

Distributed Bayesian Optimization
23 stars 8 forks source link

Reload #8

Closed yngtodd closed 6 years ago

yngtodd commented 6 years ago

Adds fault tolerant checkpointing! Now we can resume from previous distributed runs, even if some of the ranks failed. 🔥