Closed MrRace closed 4 months ago
Please specify --context-window-size
for Qwen 1.5. BTW I just ran it a few days ago, it works
@Hzfengsy Which version of Qwen1.5 are you specifically using? Qwen1.5-0.5B-Chat? Or Qwen1.5-1.8B-Chat? Or Qwen1.5-4B-Chat?
4B
@Hzfengsy Iam facing the same issue. Is your issue resolved now? "[Qwen1.5-1.8B-Chat ] in mlc-llm, when click the chat entrance in, the model can be loaded normally, but after starting chatting with input text, it gets stuck, and then after a while, the entire application crashes?"
π Bug
Has anyone encountered the situation where using Qwen1.5-4B-Chat and [Qwen1.5-1.8B-Chat ]()in mlc-llm, when click the chat entrance in, the model can be loaded normally, but after starting chatting with input text, it gets stuck, and then after a while, the entire application crashes?
To enable Qwen1.5-1.8B-Chat and Qwen1.5-4B-Chat to run on Android, model format conversion and app compilation were performed using mlc-llm. After installing the app on the phone, upon clicking the chat entry, the Qwen1.5-1.8B-Chat or Qwen1.5-4B-Chat model can be loaded normally. However, upon entering text to start chatting, the app gets stuck and after some time, the entire application crashes. Has anyone encountered this situation?
It is important to note that Qwen1.5-0.5B-Chat can load the model normally and engage in chatting with user input without any issues. The application crashing scenario described above occurs specifically with the Qwen1.5-4B-Chat and Qwen1.5-1.8B-Chat models. Since the application gets stuck or crashes after loading the model, entering text, and clicking send, there are no log messages available for reference.
To Reproduce
Steps to reproduce the behavior:
Step 1: Weight Conversion
Step 2: Generate Configuration Files
Step 3: Model Compilation
This generates the
dist/libs/Qwen1.5-1.8B-Chat-q4f16_1-android.tar
file.Step 4: Modify
app-config.json
FileModify the contents of
./android/library/src/main/assets/app-config.json
as follows:Step 5: Bundle Model Library
Step 6: Build Android App
Expected behavior
Environment
conda
, source): mlc-llm-nightly-cu122=0.1.dev1002pip
, source): build from source, version=0.15.dev154+gc06ec1f24
python -c "import tvm; print('\n'.join(f'{k}: {v}' for k, v in tvm.support.libinfo().items()))"
, applicable if you compile models):Additional context