NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Apache License 2.0

1.6k stars 255 forks source link

This is due to /build_tools/utils.py: from functools import cache

which is not a Python 3.8 feature.

CUDACXX=/usr/local/cuda/bin/nvcc NVTE_FRAMEWORK=pytorch python setup.py bdist_wheel

Traceback (most recent call last): File "setup.py", line 13, in from build_tools.build_ext import CMakeExtension, get_build_ext File "/w/github/TransformerEngine/build_tools/build_ext.py", line 20, in from .utils import ( File "/w/github/TransformerEngine/build_tools/utils.py", line 13, in from functools import cache ImportError: cannot import name 'cache' from 'functools' (/opt/conda/lib/python3.8/functools.py)

NVIDIA / TransformerEngine

TransformerEngine setup.py fails with Python 3.8 #958

CUDACXX=/usr/local/cuda/bin/nvcc NVTE_FRAMEWORK=pytorch python setup.py bdist_wheel