Closed limcheekin closed 3 weeks ago
Investigating, thanks for the report! A possibly related issue was found in https://github.com/mlc-ai/relax/issues/325 and downgrading LLVM to <= 18 seemed to work. The specific error strings were different from the ones you mentioned, but as a short-term workaround, could you try the build process with LLVM <= 18?
@nihalgeorge01 Thanks for quick response, it works! :)
Even there are some errors on my console during the build process with LLVM <= 18, I managed to run mlc_llm chat HF://mlc-ai/Llama-3.2-1B-Instruct-q4f16_0-MLC
after the build and chat with it on my console.
I have a newbie question: How do I know which models hosted at https://huggingface.co/mlc-ai is compatible with and runnable on Orange Pi 5 Max? I tried HF://mlc-ai/Llama-3-8B-Instruct-fp8-MLC
, it seems loaded successfully, but the response text is many exclamation marks (!
)?
Given the following models:
I understood that q4f32
will generate better response than q4f16
, q4
is better than q0
, is my understanding correct? What is the _0
in q4f16_0
and _1
in q4f32_1
for models above?
Please advise. Thank you.
🐛 Bug
I try to build TVM Unity compiler from source by follow instructions at https://llm.mlc.ai/docs/install/tvm.html#option-2-build-from-source.
No error when I build the tvm runtime only using the command
cmake .. && cmake --build . --target runtime --parallel $(nproc)
.But there are some errors when building with the command
cmake .. && cmake --build . --parallel $(nproc)
The following is build log:
The build log is long, it is hard to copy from the console screen. Let's me know if you need more information and the complete build log, I will re-run the build and store the log to log file.
Thanks in advance.
Expected behavior
It should build successfully without errors.
Environment
conda
, source): sourcepip
, source): sourcepython -c "import tvm; print('\n'.join(f'{k}: {v}' for k, v in tvm.support.libinfo().items()))"
, applicable if you compile models): TVM_GIT_COMMIT_HASH=dc87019cb805d0a1f0075f6415cc979ef337ec2apython -c "import tvm; print('\n'.join(f'{k}: {v}' for k, v in tvm.support.libinfo().items()))"
return the following error when the build completed.