Hi @RaymondWang0 ,
I'm trying to implement this solution on windows(cpu) OS. And the pre-requisites have been met.
Model used is LLaMA2_7B_chat_awq_int4 --QM QM_x86
We are getting the same error for CUDA model too.
I'm getting error with 'make chat -j' as cuda is not available.
Below I've attached the screenshot of error.
Please provide the solution for this error.
Thanks,
Swathi S
Hi @RaymondWang0 , I'm trying to implement this solution on windows(cpu) OS. And the pre-requisites have been met. Model used is LLaMA2_7B_chat_awq_int4 --QM QM_x86 We are getting the same error for CUDA model too. I'm getting error with 'make chat -j' as cuda is not available. Below I've attached the screenshot of error. Please provide the solution for this error. Thanks, Swathi S