Open ghost opened 3 years ago
Thanks for the report @WarmHouse! I will look into it ASAP! (I'm a bot).
Hi, NNFusion leverages CUDA lib kernels (e.g., cuBLAS, cuDNN) by default. In the artifact, some kernels (e.g., Dot, Convolution) are tuned in TVM or implemented manually and are converted to rOperator kernels and injected into the kernel DB for better performance. Here is the example in the tutorial. And here is the document of building kernel DB for the artifact models from pre-tuned kernels.
🐛 Bug
I try to reproduce the LSTM model according to the tutorial. https://github.com/microsoft/nnfusion/blob/osdi20_artifact/artifacts/get_started_tutorial/README_GET_STARTED.md
It seems that the final does not use the MatMul operator generated by tvm.
The environment configuration