Open abhullar-tt opened 2 weeks ago
Could you walk me through the sequence being used to load fw on the eth cores?
Could you walk me through the sequence being used to load fw on the eth cores?
This sequence has been working for the p100s and nothing has changed here with the new FW
Adding @TTDRosen for visibility
This issue was expected to be hit on ethernet cores with active links on P150s but is not expected on:
Looks like we are running into this in the unexpected cases because FW on these link-less cores is running an init sequence. Proposed solution from @bingliTT is for FW on these cores to skip straight to a heartbeat counter which shouldn't have any issues if Metal is using eth risc0
FYI @ttmchiou
New eth FW and FW for tt-smi reset has broken our ability to target / load fw on eth cores. This is seen in p100s which do not have any active eth cores.
This problem is showing up on p100s and p150s
CI machines are running this new FW and currently no tests can be run.
Workaround in main: Push a patch to make it look like BH has no eth cores