Move fused loop to just before the last loop being fused, to avoid any ssa dominance issue;
More informative failure message when broken dependence between air.channel.get---as data producer---and air.channel.put---as data consumer---is detected in the generated fused scf.for loop.
air.wait_all
;air.channel.get
---as data producer---andair.channel.put
---as data consumer---is detected in the generated fusedscf.for
loop.