UbiquitousLearning / mllm

Fast Multimodal LLM on Mobile Devices
https://ubiquitouslearning.github.io/mllm_website
MIT License
537 stars 60 forks source link

feat: Boost xnnpack backend inference speed by freeze tensor weight. #174

Closed chenghuaWang closed 3 weeks ago