FusedBatchMatMul precedence over FusedMatMul

intel / intel-extension-for-tensorflow

Intel® Extension for TensorFlow*

Other

317 stars 40 forks source link

I am running Hugging Face OPT 350M model with ITEX 2.14

As part of Intermediate block, when Relu/Gelu Fusion is being triggered I noticed that resultant op type is of FusedBatchMatMul instead of FusedMatMul (where postops would be Gelu Approximate/Exact/Relu).

Input Graph:

Resultant Graph:

Attaching the reference where Graph level Fusion is being taken place for this - https://github.com/intel/intel-extension-for-tensorflow/blob/d8fe3daa49f81767c1dd783325c330a145d945bd/itex/core/graph/remapper/remapper.cc#L912

Any specific reason why FusedBatchMatMul precedence is chosen over FusedMatMul?

intel / intel-extension-for-tensorflow

FusedBatchMatMul precedence over FusedMatMul #62