Open nigelzzzzzzz opened 4 days ago
Hi @nigelzzzzzzz, I don't think I can explain it better than the code but effectively a model is equivalent to a program -- just like how compilers in the past converted programs to binary, they some times go through intermediary representations. These intermediary representations are expressed in MLIR. StableHLO is a dialect of MLIR, and so is VHLO. During this process to convert a model, there are sometimes opportunities to optimize the model/program/graph. It would take a long time to explain this all and I do not know every piece myself -- you might want to dig into the code yourself to see how you can make sense of it.
Description of the bug:
hi @pkgoogle, i have some question about computer graph with tinyllama.
tok embedding
.stable-hlo composite op
, can i know how to work in this? i know the op fuse is a optimization method. but why choose below op to fuse.stable-hlo composite op
is22
.graph input
.Actual vs expected behavior:
No response
Any other information you'd like to share?
No response