A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
937
stars
157
forks
source link
convert int64 to long long for built-in cuda kernels #498
Closed
mzmssg closed 1 year ago