Closed rayandrew closed 3 weeks ago
There is one bug. if total_training_steps
is not specified, the default will be -1.
I added check for that as well
I think the last commit fixed the CI @hariharan-devarajan
@zhenghh04 This is ready for merge as well.
If we have 730 steps, DLIO benchmark only executes until 729
The bug also persists when user specified
total_training_steps
Fix: #235