Closed poonehmousavi closed 1 year ago
Could you please provide the installation command you used? Please note that CUDA and CPP modules (including fused_layer_norm_cuda) in apex are only installed when explicitly specified in the command arguments (see https://github.com/nvidia/apex#linux).
I have used these commands inorder to install: git clone https://github.com/NVIDIA/apex cd apex pip install -v --disable-pip-version-check --no-cache-dir --no-build-isolation --global-option="--cpp_ext" --global-option="--cuda_ext" ./
I faced the same error. This seems to be a CUDA compatibility issue. To resolve it, you can try the following steps.
export PATH=/usr/local/cuda-11.7/bin/${PATH:+:${PATH}} #should put the cuda path as installed in your device
export LD_LIBRARY_PATH=/usr/local/cuda-11.7/lib64${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
export CUDA_HOME=/usr/local/cuda-11.7
You can double-check by running:
echo $PATH
echo $LD_LIBRARY_PATH
echo $CUDA_HOME
# if pip >= 23.1 (ref: https://pip.pypa.io/en/stable/news/#v23-1) which supports multiple `--config-settings` with the same key...
pip install -v --disable-pip-version-check --no-cache-dir --no-build-isolation --config-settings "--build-option=--cpp_ext" --config-settings "--build-option=--cuda_ext" ./
I faced the same error when I used 'pip', then I use python setup.py install --cuda_ext --cpp_ext
, this work for me
I have installed apex but when running the fine-tuning, I got this error: NVIDIA-SMI 510.47.03 Driver Version: 510.47.03 CUDA Version: 11.6