Casting transpose to bfloat16 still gives float32 on output

tenstorrent / tt-forge-fe

The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their performance and efficiency.

Apache License 2.0

20 stars 3 forks source link

Repro:

checkout: dgolubovic/repro-tensor-mismatch-due-to-bfloat16

Run: pytest -svv forge/test/mlir/test_ops.py::test_transpose[params0]

ERROR | forge.op.eval.common:compare_with_golden_pcc:245 - Tensor mismatch

What we get is tensor missmatch between framework output and compiled model. What is strange that initial graph of compiler has output df float32 even though we casted it to bfloat16:

TODO: Investigate the cause of missmatch and why output dataformat is float32...

This is not currently blocker to anything but it may increase priority if we want to transfer all models to compile at bfloat16 dataformat...

tenstorrent / tt-forge-fe

Casting transpose to bfloat16 still gives float32 on output #690