Closed pjanevskiTT closed 2 months ago
Not assigning anyone, @abhullar-tt maybe you can help with this
Not assigning anyone, @abhullar-tt maybe you can help with this
The Metal issue has been assigned
Should be fixed with https://github.com/tenstorrent/tt-umd/pull/22
Describe the bug
read_device_memory(erisc_q_ptrs.data(), remote_transfer_ethernet_cores.at(chip_id)[i], ... is going past the size of remote_transfer_ethernet_cores.at(chip_id)
To Reproduce Change the
[i]
to.at(i)
to allow range checking. Alternatively, enable Asan which catches these types of errors.Steps to reproduce the behavior:
./build_metal.sh && ./build/test/tt_metal/unit_tests_fast_dispatch --gtest_filter="*TestWatcher*"
Additional context Reproduced on nebula_x1 (bgd-lab-20, single card reservation - board id 0)
With some logging I added When remote_transfer_ethernet_cores is populated we add two cores:
When it is accessed, we ask for the third one:
Note: this is copy of https://github.com/tenstorrent/tt-metal/issues/10200 because Ivan couldn't make issue here since he doesn't have permissions at the moment