Open sayakpaul opened 2 days ago
I think it might be related to the same issue @SunMarc had: the int4 kernels have not been compiled because one of the devices on your host has a CUDA arch that is lower than sm80. Can you try with the fix I just pushed ?
Might be fixed by #227
No, it doesn't :(
Here's my nvcc -V
:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Fri_Sep__8_19:17:24_PDT_2023
Cuda compilation tools, release 12.3, V12.3.52
Build cuda_12.3.r12.3/compiler.33281558_0
nvidia-smi
:
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.129.03 Driver Version: 535.129.03 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA A100-SXM4-80GB On | 00000000:01:00.0 Off | 0 |
| N/A 49C P0 92W / 275W | 1690MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 1 NVIDIA A100-SXM4-80GB On | 00000000:47:00.0 Off | 0 |
| N/A 50C P0 93W / 275W | 8MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 2 NVIDIA A100-SXM4-80GB On | 00000000:81:00.0 Off | 0 |
| N/A 49C P0 94W / 275W | 8MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 3 NVIDIA DGX Display On | 00000000:C1:00.0 Off | N/A |
| 34% 37C P8 N/A / 50W | 3MiB / 4096MiB | 0% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+
| 4 NVIDIA A100-SXM4-80GB On | 00000000:C2:00.0 Off | 0 |
| N/A 50C P0 98W / 275W | 8MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| 0 N/A N/A 836338 C ...iniconda3/envs/parlertts/bin/python 1676MiB |
+---------------------------------------------------------------------------------------+
@sayakpaul can you try uninstalling then installing optimum-quanto
, just to make sure there is no obsolete cached extension ?
Yeah did that too but still failing @dacorvo
Install
diffusers
first.And then do:
I am on the HF DGX. My PyTorch version is 2.3.1. I installed
quanto
frommain
.Getting:
Cc: @dacorvo @SunMarc