mlc-ai / binary-mlc-llm-libs

167 stars 43 forks source link

Resource consumption degradation #97

Open remixer-dec opened 4 months ago

remixer-dec commented 4 months ago

Hi. I try this app from time to time to look over the progress in mobile LLMs. In one of the previous versions of MLCChat (a6b0a4c from 19.09.2023) my device with 8GB RAM managed to run a 7B model, but now none of the 7B models present in the app work, even if I clean up the RAM completely. CL_OUT_OF_RESOURCES in opencl_device_api_cc:246 is the error. Llama prints the error message in the chat, Mistral successfully loads the model, but after starting the generation it crashes the app with the same error. Snapdragon 860