Distributed checkpoint loading failed because workers disagreed on the loading method. Fix that, make checkpoint loading a bit less verbose, and add some safety barriers.
🔍 Type of change
Select all that apply:
[x] 🐛 Bug fix (non-breaking change that addresses a specific issue)
[ ] 🚀 New feature (non-breaking change that adds functionality)
[ ] ⚠️ Breaking change (a change that could affect existing functionality)
✨ Description
Distributed checkpoint loading failed because workers disagreed on the loading method. Fix that, make checkpoint loading a bit less verbose, and add some safety barriers.
🔍 Type of change
Select all that apply: