A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
958
stars
162
forks
source link
Fix memcpy in fused IR; fix kernel elimination rules; fix kernel shape check; optimize GEMM op convert #402
Closed
jlxue closed 2 years ago