A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Traceback (most recent call last):
File "setup.py", line 13, in
from build_tools.build_ext import CMakeExtension, get_build_ext
File "/w/github/TransformerEngine/build_tools/build_ext.py", line 20, in
from .utils import (
File "/w/github/TransformerEngine/build_tools/utils.py", line 13, in
from functools import cache
ImportError: cannot import name 'cache' from 'functools' (/opt/conda/lib/python3.8/functools.py)
This is due to /build_tools/utils.py: from functools import cache
which is not a Python 3.8 feature.
CUDACXX=/usr/local/cuda/bin/nvcc NVTE_FRAMEWORK=pytorch python setup.py bdist_wheel
Traceback (most recent call last): File "setup.py", line 13, in
from build_tools.build_ext import CMakeExtension, get_build_ext
File "/w/github/TransformerEngine/build_tools/build_ext.py", line 20, in
from .utils import (
File "/w/github/TransformerEngine/build_tools/utils.py", line 13, in
from functools import cache
ImportError: cannot import name 'cache' from 'functools' (/opt/conda/lib/python3.8/functools.py)