[Bug Report] Incorrect output When Using Mixed uint8 and bfloat16 Formats in Compute Kernel

tenstorrent / tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

Apache License 2.0

430 stars 58 forks source link

Describe the bug We discovered a new bug while investigating issue #11962. I found that the output results are incorrect when mixing bfloat16 and uint8 in the compute kernel while using them in the CB and DST registers.

To Reproduce branch : ilkoo/uint8_dst_reg

compute kernel : https://github.com/tenstorrent/tt-metal/blob/ilkoo/uint8_dst_reg/ttnn/cpp/ttnn/deprecated/tt_dnn/op_library/moreh_test2/kernels/moreh_test2.cpp

To reproduce the error, run the following command from the ilkoo/uint8_dst_reg branch: pytest tests/tt_eager/python_api_testing/unit_testing/misc/test_moreh_test2.py

This branch includes the changes from rd/stall_unpack_reconfig.

It does not occur when mixing float32 and uint8.

Expected behavior The expected output (PASSED) should be produced.

tenstorrent / tt-metal

[Bug Report] Incorrect output When Using Mixed uint8 and bfloat16 Formats in Compute Kernel #12963