Closed gbossu closed 3 weeks ago
note: we are lagging behind upstream by a couple of months, so i cherry-picked some commits from there to minimise conflicts.
This PR extends MachineLICM in a very clever way. I left some minor comments, mostly for clarification.
This mostly extends the existing post-RA LICM pass so that it actually does something about instructions with register inputs. I'll see if I can upstream those changes.
Then there is a DAGMutator change to give more opportunities to
MachineLICM
Better review commit by commit.
I'll check the 30% regression in ReLu_bfloat16 in more detail (it comes from extra spills). But even in this state the QoR is good.