issues
search
UbiquitousLearning
/
mllm
Fast Multimodal LLM on Mobile Devices
https://ubiquitouslearning.github.io/mllm_website
MIT License
394
stars
48
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix: clang-tidy, set line width to 100
#142
chenghuaWang
opened
14 hours ago
0
Compile .so for Android
#141
KAIWEILIUCC
opened
4 days ago
1
The Llama 7B model works on my Android phone, but other models do not.
#140
siz0001
opened
4 days ago
2
I converted custom model, llama-ko-7b, and when play demo get this error message.
#139
siz0001
closed
1 day ago
4
where is bin folder?
#138
siz0001
closed
1 day ago
6
Using Qwen-2.0
#137
KAIWEILIUCC
opened
1 week ago
2
refactor:: remove Layer Class `Split`, replace it with `Tensor::split`
#136
yirongjie
closed
2 weeks ago
0
APk BUild issues
#135
Vinaysukhesh98
opened
2 weeks ago
0
Failed to allocate memory error on Galaxy S24 NPU
#134
gingerly
opened
2 weeks ago
0
运行时出现merge file is broken
#133
mailonghua
closed
2 weeks ago
2
feat: add MiniCPM 2B demo
#132
yirongjie
closed
2 weeks ago
0
关于OpPackage-LLaMAAdd中的Q6_V_valign_VVR计算方式的疑惑
#131
mailonghua
opened
2 weeks ago
0
refactor: `Tensor::run` &`Layer::getFunc`
#130
yirongjie
closed
3 weeks ago
0
fix: +-*/ for old front end
#129
yirongjie
closed
3 weeks ago
0
run “run_qwen_npu.sh “ fail. chip is Snapdragon 870
#128
yangh0597
closed
3 weeks ago
1
fix 修复windows环境
#127
WhiteNight123
closed
3 weeks ago
0
How did you obtain the two model files, qwen-1.5-1.8b-chat-int8.mllm and qwen-1.5-1.8b-chat-q4k.mllm?
#126
yhwang-hub
opened
3 weeks ago
3
./bin/main_qwen_npu fail
#125
zcxo
opened
3 weeks ago
2
./run_qwen-npu.sh failed
#124
zcxo
closed
3 weeks ago
2
./run_qwen_npu.sh失败
#123
zcxo
closed
3 weeks ago
0
perf: CPU Function: +-*/
#122
yirongjie
closed
1 month ago
0
refactor: `Tensor::run` &`Layer::getFunc`: Tensor& -> Tensor
#121
yirongjie
closed
1 month ago
0
refactor: `Layer::run` & `Tensor::getStaticFunc`
#120
yirongjie
closed
1 month ago
0
feat: add Phi-3-mini model
#119
WhiteNight123
closed
1 month ago
0
为什么预填充和解码不能都在 NPU 上运行?
#118
yhwang-hub
opened
1 month ago
4
Segmentation fault on OPPO FindX7 Ultra (Snapdragon8Gen3)
#117
bingo787
opened
1 month ago
1
Prefill speed is approximately 4~6 tokens/s for Qwen1.5-1.8B
#116
mengllm
opened
1 month ago
5
Crash on Xiaomi 14(8gen3) with QNN
#115
zhuipiaochen
opened
1 month ago
1
Android crashed and forcely rebooted when executing main_qwen_npu
#114
taegeonum
opened
1 month ago
9
doc: Update README.md
#113
xumengwei
closed
1 month ago
0
feat: Preliminary implementation on Qualcomm NPU (QNN) backend.
#112
liang1232018
closed
1 month ago
0
feat: llamafile_sgemm bias support
#111
chenghuaWang
closed
1 month ago
0
chore: Disable OpenMP for Mac.
#110
lx200916
closed
1 month ago
0
feat: GEMV + Bias mixed precision support for ARM Devices
#109
chenghuaWang
closed
1 month ago
0
feat: add clear_kvcache && fix: BUG in quantize.
#108
yirongjie
closed
1 month ago
0
fix: Qwen v1.5 Tokenizer bug
#107
chenghuaWang
closed
1 month ago
1
feat: add DEBUGSAVETENSOR & DEBUGOPTIME
#106
yirongjie
closed
1 month ago
0
feat: topk/topp sampling
#105
chenghuaWang
closed
1 month ago
1
perf: add AArch64 GEMM/GEMV for q4_0.
#104
yirongjie
closed
1 month ago
0
What is the difference between ggml and your project?
#103
Subuday
closed
1 month ago
4
Want to contribute. Looking for ideas.
#102
oddlyspaced
closed
1 month ago
2
perf: Use `vector<shared_ptr<Tensor>> Tensor::graphs`
#101
yirongjie
closed
1 month ago
0
feat: add Qwen 1.8B demo
#100
yirongjie
closed
1 month ago
0
feat:Add OPT support
#99
yirongjie
closed
2 months ago
0
feat: add elastic llama
#98
yirongjie
closed
2 months ago
0
Is Subgraph Heterogeneous Compute Available in MLLM?
#97
MaTwickenham
opened
2 months ago
2
[Question] Does this demo support mediatek neuropilot?
#96
AkaneTan
closed
2 months ago
2
doc: Update README.md
#95
yirongjie
closed
2 months ago
0
feat: Stablelm 2 1.6b support
#94
emt0re0
closed
2 months ago
0
Layer not found when trying mllm in Google Colab
#93
gsamaras
closed
2 months ago
9
Next