I think the solution should be that the size calculation for the output buffer should match CB size calculation.
Example
Branch: main
Compile in debug mode to enable the above assertion.
Test: pytest "tests/ttnn/unit_tests/operations/test_maxpool2d.py::test_run_max_pool[dtype=DataType.BFLOAT8_B-dilation=(1, 1)-stride=(2, 2)-padding=(0, 0)-kernel_size=(2, 2)-act_shape=[1, 256, 56, 56]-device_params={'l1_small_size': 24576}]"
For
TILE
inputs +HEIGHT_SHARDING
, I2S op does the following currently:TILE
height * number of shards, the two sizes are different, and leads to assertion with the CB allocation.I think the solution should be that the size calculation for the output buffer should match CB size calculation.
Example Branch:
main
Compile in debug mode to enable the above assertion. Test:pytest "tests/ttnn/unit_tests/operations/test_maxpool2d.py::test_run_max_pool[dtype=DataType.BFLOAT8_B-dilation=(1, 1)-stride=(2, 2)-padding=(0, 0)-kernel_size=(2, 2)-act_shape=[1, 256, 56, 56]-device_params={'l1_small_size': 24576}]"