Closed alphaarea closed 5 months ago
because of the nightly size issue, we had to set architectures for the nightly to align with latest ones. Building from source might be able to help this situation, closing this for now , feel free to open new ones
π Bug
I run mlc-llm on a server with Tesla P100. The last installation of mlc-llm from nightly build with cublas_gemm support was in 12/24/2023.
After entering 2024, all mlc-llm I installed lost cublas_gemm support
TVM Unity Hash Tag at the end
To Reproduce
Check the compilation parameter in the log
Expected behavior
Notice:
--opt flashinfer=0;cublas_gemm=0;cudagraph=1
Performance without cublas_gemm:
The following is the output of the old version that supports cublas
Performance with cublas_gemm:
Environment
install in 12/24/2023 with cuBLAS support
install in 1/6/2024 without cuBLAS support