Closed ganler closed 4 years ago
https://mlsys.org/Conferences/2019/doc/2019/16.pdf
Yet another hardcore work from Zhihao.
Measuring distributed execution on real hardware is slow.
2 Obs.
Execution simulator:
Operator Graph to Task Graph.
Incremental update. (Less re-profiling)
Reduce search time by 2-7x => take only a few minutes.
https://mlsys.org/Conferences/2019/doc/2019/16.pdf
Yet another hardcore work from Zhihao.