NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Other
271 stars 53 forks source link

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Installation

We publish nightly wheel packages on https://pypi.nvidia.com, while build against stable torch version on https://pypi.org. Wheels are published for Python version: 3.10, 3.12.

built-env cuda 11.8 cuda 12.1 cuda 12.4
torch 2.5 (pypi.org) nvfuser-cu118-torch25 nvfuser-cu121-torch25 nvfuser-cu124-torch25
torch nightly (pypi.nvidia.com) nvfuser-cu118 nvfuser-cu121 nvfuser-cu124

Note that nvfuser built against stable torch version isn't compatible with nightly pytorch wheel, so ensure you pick the right version suiting your environment.

nightly nvfuser pip wheel

You can instll a nightly nvfuser pip package built against torch nightly code base with pip install --pre nvfuser-cu121 --extra-index-url https://pypi.nvidia.com

As we build against nightly torch wheel and there's no compatibility promised on nightly wheels, we have explicitly marked the nightly torch wheel as an optinoal dependency. You can choose to install the torch wheel along with nvfuser package. e.g. pip install --pre "nvfuser-cu121[torch]" --extra-index-url https://pypi.nvidia.com. Note that this may uninstall your local pytorch installation and install the compatible nightly pytorch.

nvfuser pip wheel against pytorch stable release

nvfuser pip wheel built against stable torch releases is published on pypi.org. Pick the right cuda toolkit version to match your torch installation. e.g. pip install nvfuser-cu121-torch24

PyPI: https://pypi.org/project/nvfuser/

Developer

Docs: https://github.com/NVIDIA/Fuser/wiki

Supported compilers:

GCC:

We support all "supported releases" of gcc as specified in the official site. As of 6/20/2024, they are:

Clang:

Supported C++ standard:

We are actively considering dropping C++17 support