Closed OriginalGoku closed 1 year ago
Your GPU is not supported, so the install probably failed
@casper-hansen Thanks for your prompt reply. You are right. When I tried to build the code using git, it gave me the following error message: RuntimeError: GPUs with compute capability less than 7.5 are not supported. but when I tried to install it using pip install i did not get any error messages so initially I thought everything was installed properly.
I followed the instructions to install AutoAWQ
Here is my code: `from transformers import AutoTokenizer from awq import AutoAWQForCausalLM
Load Model and Tokenizer
def load_model_tokenizer(): model_name_or_path = "TheBloke/Mistral-7B-OpenOrca-AWQ"
if name == 'main': model, tokenizer = load_model_tokenizer()
`
and here is the error message:
Traceback (most recent call last): File "C:\Users\User\PycharmProjects\Mistral7B\main.py", line 4, in <module> from awq import AutoAWQForCausalLM File "C:\Users\User\PycharmProjects\Mistral7B\venv\lib\site-packages\awq\__init__.py", line 2, in <module> from awq.models.auto import AutoAWQForCausalLM File "C:\Users\User\PycharmProjects\Mistral7B\venv\lib\site-packages\awq\models\__init__.py", line 1, in <module> from .mpt import MptAWQForCausalLM File "C:\Users\User\PycharmProjects\Mistral7B\venv\lib\site-packages\awq\models\mpt.py", line 1, in <module> from .base import BaseAWQForCausalLM File "C:\Users\User\PycharmProjects\Mistral7B\venv\lib\site-packages\awq\models\base.py", line 11, in <module> from awq.quantize.quantizer import AwqQuantizer File "C:\Users\User\PycharmProjects\Mistral7B\venv\lib\site-packages\awq\quantize\quantizer.py", line 11, in <module> from awq.modules.linear import WQLinear_GEMM, WQLinear_GEMV File "C:\Users\User\PycharmProjects\Mistral7B\venv\lib\site-packages\awq\modules\linear.py", line 4, in <module> import awq_inference_engine # with CUDA kernels ImportError: DLL load failed while importing awq_inference_engine: The specified procedure could not be found.
I am using torch version 2.1.0 and CUDA 11.8 I have an NVIDIA GeForce GTX 1070 with 32GB GPU RAM (8 GB Dedicated and 24GB Shared GPU Ram)