Closed aditew01 closed 6 days ago
This enables jit:uni reorder for dtype-> bf16 -> f32
jit:uni
bf16 -> f32
make test
make test_benchdnn_*
Adding a small testcase logs to demonstrate perf numbers before and after.
TC: ONEDNN_VERBOSE=all ./benchdnn --reorder --sdt=bf16 --ddt=f32 --mode=p 1x4096x4096
ONEDNN_VERBOSE=all ./benchdnn --reorder --sdt=bf16 --ddt=f32 --mode=p 1x4096x4096
(new) jit:any :
onednn_verbose,v1,primitive,exec,cpu,reorder,jit:uni,undef,src:bf16::blocked:abc::f0 dst:f32::blocked:abc::f0,,,1x4096x4096,0.493896
(old) simple:any :
onednn_verbose,v1,primitive,exec,cpu,reorder,simple:any,undef,src:bf16::blocked:abc::f0 dst:f32::blocked:abc::f0,,,1x4096x4096,11.9778
Thanks for the PR. Please rename the commit message to adhere to the oneDNN coding guideline.
cc: @theComputeKid
cc: @Sqvid
cc: @mgouicem
Description
This enables
jit:uni
reorder for dtype->bf16 -> f32
Checklist
General
make test
andmake test_benchdnn_*
) pass locally for each commit? - YesPerformance improvements
Adding a small testcase logs to demonstrate perf numbers before and after.
TC:
ONEDNN_VERBOSE=all ./benchdnn --reorder --sdt=bf16 --ddt=f32 --mode=p 1x4096x4096
(new) jit:any :
(old) simple:any :