Dao-AILab / flash-attention

Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
13.63k stars 1.25k forks

Undefined symbol #723

jonathanasdf opened 9 months ago

jonathanasdf commented 9 months ago
RuntimeError: Failed to import transformers.models.llama.modeling_llama because of the following error (look up to see its traceback):
/opt/venv/lib/python3.11/site-packages/flash_attn_2_cuda.cpython-311-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9_pad_enum4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEElNS5_8optionalIdEE

I'm using cuda 12.1, pytorch nightly 2.2.0+cu121, flash attention from source (pip install git+https://github.com/Dao-AILab/flash-attention.git@92dd570 --no-build-isolation), any idea what I can do to debug this?

kandeldeepak46 commented 9 months ago

getting same

sleeper1023 commented 9 months ago

Me too, can anyone solve this problem?

tcapelle commented 9 months ago


tridao commented 9 months ago

It's because of torch version change. nvcr pytorch 23.12 should work with flash-attn v2.4.0.post1 now. If you're using torch-nightly, we currently use torch-nightly 20231106 to compile the CUDA wheel, so if your torch-nightly version is close to that it should work.

tianyunzqs commented 9 months ago

I installed flash-attn v2.4.2 successfully with python3.10+cuda12.1+torch2.1.0+transformers4.36.2
here is my environment

Eikor commented 8 months ago

same problem on torch 2.1.0+cuda 12.1: ImportError: /usr/local/lib/python3.8/dist-packages/flash_attn_2_cuda.cpython-38-x86_64-linux-gnu.so: undefined symbol: _ZN3c104cuda9SetDeviceEi

ArlanCooper commented 8 months ago

same, torch2.2.0+cuda11.8: undefined symbol: _ZN2at4_ops9_pad_enum4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEElNS5_8optionalIdEE

pseudotensor commented 8 months ago
Python 3.10.13 | packaged by conda-forge | (main, Dec 23 2023, 15:36:39) [GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import flash_attn
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/flash_attn/__init__.py", line 3, in <module>
    from flash_attn.flash_attn_interface import (
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/flash_attn/flash_attn_interface.py", line 8, in <module>
    import flash_attn_2_cuda as flash_attn_cuda
ImportError: /home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9_pad_enum4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEElNS5_8optionalIdEE

started getting this, no idea why. Nothing mentioned works. Tried cuda11.8 and 12.1, all consistent install, never works.

    export CUDA_HOME=/usr/local/cuda-12.1
    export PIP_EXTRA_INDEX_URL="https://download.pytorch.org/whl/cu121"
    pip install --upgrade pip
    pip install flash-attn==2.4.2 --no-build-isolation --no-cache-dir
pseudotensor commented 8 months ago

Same issue with latest version of flash-attn:

(h2ogpt) jon@gpu:~$ python
Python 3.10.13 | packaged by conda-forge | (main, Dec 23 2023, 15:36:39) [GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import flash_attn
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/flash_attn/__init__.py", line 3, in <module>
    from flash_attn.flash_attn_interface import (
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/flash_attn/flash_attn_interface.py", line 10, in <module>
    import flash_attn_2_cuda as flash_attn_cuda
ImportError: /home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops5zeros4callEN3c108ArrayRefINS2_6SymIntEEENS2_8optionalINS2_10ScalarTypeEEENS6_INS2_6LayoutEEENS6_INS2_6DeviceEEENS6_IbEE
pseudotensor commented 8 months ago

Ah, seems to be issue with torch 2.2.0 and flash_attn.

caoxu915683474 commented 6 months ago

I have the same issue, has someone solved?

robinsonmd commented 5 months ago

cuda: 12.1 torch: 2.2.0+cu121 flash-attn: 2.5.7 work