Closed punithsekar closed 1 month ago
fyi @saichandax
@dvartaniansTT maxpool is by design returning bfloat16 since its output is Row Major. If BFP8_B is needed, we can add a tilize op right after.
thanks @mywoodstock. This was filed by MCW. The intention was to be able to use bfp8_b to squeeze more perf.
OK, since maxpool implementation works with row major data, the output will be bfloat16. If TILEd data is needed, we will need to implement new kernels for that format, and might be quite tricky -- not sure if its worth it.
@mywoodstock , The intention is to increase the performance as Dalar mentioned. And, I also observe there is a PCC drop for the whole model if we typecast to bfloat8_b after maxpool completion.
I will try to reproduce the pcc drop issue and attach it here.
I tried to recreate the PCC drop issue today but I am not facing it now. We can close the ticket if bf8 support cannot be given to maxpool as it uses rowmajor layout.
closing as maxpool works with RM only.
Describe the bug ttnn.max_pool2d returns bfloat16 dtype even though we pass bfloat8_b as input.
To Reproduce Steps to reproduce the behavior: Run the following code snippet
Expected behavior To return output of dtype bfloat8_b.
Please complete the following environment information: