stanford-futuredata / gavel

Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
MIT License
125 stars 31 forks source link

Add heartbeats to allow scheduler to remotely kill failed jobs #188

Open santhnm2 opened 4 years ago