Open tt-rkim opened 2 days ago
Does retrying tt-smi
actually work?
If retries work, maybe we can reproduce it by repeating the cpp tests in a custom dispatch, maybe also try to run it on the same machine too If this fixes the issue, doesn't this mean its a issue caused by the tests?
Ticket
15243
Problem description
CPP tests are unstable and fail ND often on main
What's changed
Retry to improve stability. Need to be able to reproduce so we can raise to runtime team
Checklist