NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
https://nvidia.github.io/TensorRT-LLM
Apache License 2.0
8.17k stars 904 forks source link

installation failure of package 0.8.0 #1236

Closed pfldy2850 closed 5 months ago

pfldy2850 commented 6 months ago

System Info

Who can help?

@byshiue

Information

Tasks

Reproduction

I followed the official installation guide and ran the command as shown below.

# apt-get update && apt-get -y install python3.10 python3-pip openmpi-bin libopenmpi-dev python-is-python3

# python --version
Python 3.10.12

# pip3 install tensorrt_llm -U --extra-index-url https://pypi.nvidia.com
Looking in indexes: https://pypi.org/simple, https://pypi.nvidia.com
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /tensorrt-llm/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /tensorrt-llm/
WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /tensorrt-llm/
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /tensorrt-llm/
WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /tensorrt-llm/
Collecting tensorrt_llm
  Downloading tensorrt-llm-0.8.0.tar.gz (6.9 kB)
  Preparing metadata (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [6 lines of output]
      Traceback (most recent call last):
        File "<string>", line 2, in <module>
        File "<pip-setuptools-caller>", line 34, in <module>
        File "/tmp/pip-install-qyd8c9h_/tensorrt-llm_674b90d9f8ea4f579dfca40c2c98e94c/setup.py", line 90, in <module>
          raise RuntimeError("Bad params")
      RuntimeError: Bad params
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed

× Encountered error while generating package metadata.
╰─> See above for output.

note: This is an issue with the package mentioned above, not pip.
hint: See above for details.

[notice] A new release of pip is available: 23.3.1 -> 24.0
[notice] To update, run: python3 -m pip install --upgrade pip

Expected behavior

installation success

actual behavior

installation failed

additional notes

I used the official image, nvcr.io/nvidia/tritonserver:24.02-trtllm-python-py3, and the python version is 3.10.

What I found is that during the download from pypi, I get tensorrt-llm-0.8.0.tar.gz, which appears to be a postfix-less name for the platform and python version.

jonny2027 commented 6 months ago

I get the exact same error when trying to install ammo pip download --extra-index-url https://pypi.nvidia.com nvidia-ammo

byshiue commented 6 months ago

I take a try but I cannot reproduce your issues

bhsueh@xxx:/home/scratch.bhsueh_sw_1$ nvidia-docker run -ti --gpus all --shm-size 25g nvcr.io/nvidia/tritonserver:24.02-trtllm-python-py3 bash

=============================
== Triton Inference Server ==
=============================

NVIDIA Release 24.02 (build 83572707)
Triton Server Version 2.43.0

Copyright (c) 2018-2023, NVIDIA CORPORATION & AFFILIATES.  All rights reserved.

Various files include modifications (c) NVIDIA CORPORATION & AFFILIATES.  All rights reserved.

This container image and its contents are governed by the NVIDIA Deep Learning Container License.
By pulling and using the container, you accept the terms and conditions of this license:
https://developer.nvidia.com/ngc/nvidia-deep-learning-container-license

NOTE: CUDA Forward Compatibility mode ENABLED.
  Using CUDA 12.3 driver version 545.23.08 with kernel driver version 535.129.03.
  See https://docs.nvidia.com/deploy/cuda-compatibility/ for details.
Installing dependecies and tensorrt_llm ```bash root@6d80b0d9f83a:/opt/tritonserver# apt-get update && apt-get -y install python3.10 python3-pip openmpi-bin libopenmpi-dev python-is-python3 Get:1 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 InRelease [1581 B] Get:2 http://security.ubuntu.com/ubuntu jammy-security InRelease [110 kB] Get:3 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages [736 kB] Get:4 http://archive.ubuntu.com/ubuntu jammy InRelease [270 kB] Get:5 http://security.ubuntu.com/ubuntu jammy-security/main amd64 Packages [1533 kB] Get:6 http://security.ubuntu.com/ubuntu jammy-security/multiverse amd64 Packages [44.6 kB] Get:7 http://security.ubuntu.com/ubuntu jammy-security/universe amd64 Packages [1076 kB] Get:8 http://security.ubuntu.com/ubuntu jammy-security/restricted amd64 Packages [1914 kB] Get:9 http://archive.ubuntu.com/ubuntu jammy-updates InRelease [119 kB] Get:10 http://archive.ubuntu.com/ubuntu jammy-backports InRelease [109 kB] Get:11 http://archive.ubuntu.com/ubuntu jammy/universe amd64 Packages [17.5 MB] Get:12 http://archive.ubuntu.com/ubuntu jammy/restricted amd64 Packages [164 kB] Get:13 http://archive.ubuntu.com/ubuntu jammy/multiverse amd64 Packages [266 kB] Get:14 http://archive.ubuntu.com/ubuntu jammy/main amd64 Packages [1792 kB] Get:15 http://archive.ubuntu.com/ubuntu jammy-updates/restricted amd64 Packages [1952 kB] Get:16 http://archive.ubuntu.com/ubuntu jammy-updates/universe amd64 Packages [1347 kB] Get:17 http://archive.ubuntu.com/ubuntu jammy-updates/multiverse amd64 Packages [50.4 kB] Get:18 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 Packages [1812 kB] Get:19 http://archive.ubuntu.com/ubuntu jammy-backports/universe amd64 Packages [28.1 kB] Get:20 http://archive.ubuntu.com/ubuntu jammy-backports/main amd64 Packages [50.4 kB] Fetched 30.8 MB in 4s (8287 kB/s) Reading package lists... Done Reading package lists... Done Building dependency tree... Done Reading state information... Done python3.10 is already the newest version (3.10.12-1~22.04.3). python3.10 set to manually installed. python3-pip is already the newest version (22.0.2+dfsg-1ubuntu0.4). The following additional packages will be installed: autoconf automake autotools-dev file gfortran gfortran-11 javascript-common libcaf-openmpi-3 libcoarrays-dev libcoarrays-openmpi-dev libevent-2.1-7 libevent-core-2.1-7 libevent-dev libevent-extra-2.1-7 libevent-openssl-2.1-7 libevent-pthreads-2.1-7 libfabric1 libgfortran-11-dev libgfortran5 libhwloc-dev libhwloc-plugins libhwloc15 libjs-jquery libjs-jquery-ui libltdl-dev libltdl7 libmagic-mgc libmagic1 libopenmpi3 libpciaccess0 libpmix-dev libpmix2 libpsm-infinipath1 libpsm2-2 libsigsegv2 libtool libucx0 libx11-6 libx11-data libxau6 libxcb1 libxdmcp6 libxext6 libxnvctrl0 m4 ocl-icd-libopencl1 openmpi-common Suggested packages: autoconf-archive gnu-standards autoconf-doc gettext gfortran-multilib gfortran-doc gfortran-11-multilib gfortran-11-doc apache2 | lighttpd | httpd libhwloc-contrib-plugins libjs-jquery-ui-docs libtool-doc openmpi-doc pciutils gcj-jdk m4-doc opencl-icd The following NEW packages will be installed: autoconf automake autotools-dev file gfortran gfortran-11 javascript-common libcaf-openmpi-3 libcoarrays-dev libcoarrays-openmpi-dev libevent-2.1-7 libevent-core-2.1-7 libevent-dev libevent-extra-2.1-7 libevent-openssl-2.1-7 libevent-pthreads-2.1-7 libfabric1 libgfortran-11-dev libgfortran5 libhwloc-dev libhwloc-plugins libhwloc15 libjs-jquery libjs-jquery-ui libltdl-dev libltdl7 libmagic-mgc libmagic1 libopenmpi-dev libopenmpi3 libpciaccess0 libpmix-dev libpmix2 libpsm-infinipath1 libpsm2-2 libsigsegv2 libtool libucx0 libx11-6 libx11-data libxau6 libxcb1 libxdmcp6 libxext6 libxnvctrl0 m4 ocl-icd-libopencl1 openmpi-bin openmpi-common python-is-python3 0 upgraded, 50 newly installed, 0 to remove and 35 not upgraded. Need to get 24.9 MB of archives. After this operation, 104 MB of additional disk space will be used. Get:1 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 libxnvctrl0 550.54.14-0ubuntu1 [21.3 kB] Get:2 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 libmagic-mgc amd64 1:5.41-3ubuntu0.1 [257 kB] Get:3 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 libmagic1 amd64 1:5.41-3ubuntu0.1 [87.2 kB] Get:4 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 file amd64 1:5.41-3ubuntu0.1 [21.5 kB] Get:5 http://archive.ubuntu.com/ubuntu jammy/main amd64 libxau6 amd64 1:1.0.9-1build5 [7634 B] Get:6 http://archive.ubuntu.com/ubuntu jammy/main amd64 libxdmcp6 amd64 1:1.1.3-0ubuntu5 [10.9 kB] Get:7 http://archive.ubuntu.com/ubuntu jammy/main amd64 libxcb1 amd64 1.14-3ubuntu3 [49.0 kB] Get:8 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 libx11-data all 2:1.7.5-1ubuntu0.3 [120 kB] Get:9 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 libx11-6 amd64 2:1.7.5-1ubuntu0.3 [667 kB] Get:10 http://archive.ubuntu.com/ubuntu jammy/main amd64 libxext6 amd64 2:1.3.4-1build1 [31.8 kB] Get:11 http://archive.ubuntu.com/ubuntu jammy/main amd64 libsigsegv2 amd64 2.13-1ubuntu3 [14.6 kB] Get:12 http://archive.ubuntu.com/ubuntu jammy/main amd64 m4 amd64 1.4.18-5ubuntu2 [199 kB] Get:13 http://archive.ubuntu.com/ubuntu jammy/main amd64 autoconf all 2.71-2 [338 kB] Get:14 http://archive.ubuntu.com/ubuntu jammy/main amd64 autotools-dev all 20220109.1 [44.9 kB] Get:15 http://archive.ubuntu.com/ubuntu jammy/main amd64 automake all 1:1.16.5-1.3 [558 kB] Get:16 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 libgfortran5 amd64 12.3.0-1ubuntu1~22.04 [879 kB] Get:17 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 libgfortran-11-dev amd64 11.4.0-1ubuntu1~22.04 [842 kB] Get:18 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 gfortran-11 amd64 11.4.0-1ubuntu1~22.04 [11.2 MB] Get:19 http://archive.ubuntu.com/ubuntu jammy/main amd64 gfortran amd64 4:11.2.0-1ubuntu1 [1182 B] Get:20 http://archive.ubuntu.com/ubuntu jammy/main amd64 javascript-common all 11+nmu1 [5936 B] Get:21 http://archive.ubuntu.com/ubuntu jammy/main amd64 libevent-core-2.1-7 amd64 2.1.12-stable-1build3 [93.9 kB] Get:22 http://archive.ubuntu.com/ubuntu jammy/main amd64 libevent-pthreads-2.1-7 amd64 2.1.12-stable-1build3 [7642 B] Get:23 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libpsm-infinipath1 amd64 3.3+20.604758e7-6.1 [170 kB] Get:24 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libpsm2-2 amd64 11.2.185-1 [182 kB] Get:25 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libfabric1 amd64 1.11.0-3 [558 kB] Get:26 http://archive.ubuntu.com/ubuntu jammy-updates/universe amd64 libhwloc15 amd64 2.7.0-2ubuntu1 [159 kB] Get:27 http://archive.ubuntu.com/ubuntu jammy/main amd64 libpciaccess0 amd64 0.16-3 [19.1 kB] Get:28 http://archive.ubuntu.com/ubuntu jammy/universe amd64 ocl-icd-libopencl1 amd64 2.2.14-3 [39.1 kB] Get:29 http://archive.ubuntu.com/ubuntu jammy-updates/universe amd64 libhwloc-plugins amd64 2.7.0-2ubuntu1 [15.6 kB] Get:30 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libpmix2 amd64 4.1.2-2ubuntu1 [604 kB] Get:31 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libucx0 amd64 1.12.1~rc2-1 [891 kB] Get:32 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libopenmpi3 amd64 4.1.2-2ubuntu1 [2594 kB] Get:33 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libcaf-openmpi-3 amd64 2.9.2-3 [36.5 kB] Get:34 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libcoarrays-dev amd64 2.9.2-3 [40.5 kB] Get:35 http://archive.ubuntu.com/ubuntu jammy/universe amd64 openmpi-common all 4.1.2-2ubuntu1 [162 kB] Get:36 http://archive.ubuntu.com/ubuntu jammy/universe amd64 openmpi-bin amd64 4.1.2-2ubuntu1 [116 kB] Get:37 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libcoarrays-openmpi-dev amd64 2.9.2-3 [452 kB] Get:38 http://archive.ubuntu.com/ubuntu jammy/main amd64 libevent-2.1-7 amd64 2.1.12-stable-1build3 [148 kB] Get:39 http://archive.ubuntu.com/ubuntu jammy/main amd64 libevent-extra-2.1-7 amd64 2.1.12-stable-1build3 [65.4 kB] Get:40 http://archive.ubuntu.com/ubuntu jammy/main amd64 libevent-openssl-2.1-7 amd64 2.1.12-stable-1build3 [15.8 kB] Get:41 http://archive.ubuntu.com/ubuntu jammy/main amd64 libevent-dev amd64 2.1.12-stable-1build3 [278 kB] Get:42 http://archive.ubuntu.com/ubuntu jammy/main amd64 libjs-jquery all 3.6.0+dfsg+~3.5.13-1 [321 kB] Get:43 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libjs-jquery-ui all 1.13.1+dfsg-1 [253 kB] Get:44 http://archive.ubuntu.com/ubuntu jammy/main amd64 libltdl7 amd64 2.4.6-15build2 [39.6 kB] Get:45 http://archive.ubuntu.com/ubuntu jammy/main amd64 libltdl-dev amd64 2.4.6-15build2 [169 kB] Get:46 http://archive.ubuntu.com/ubuntu jammy-updates/universe amd64 libhwloc-dev amd64 2.7.0-2ubuntu1 [256 kB] Get:47 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libpmix-dev amd64 4.1.2-2ubuntu1 [805 kB] Get:48 http://archive.ubuntu.com/ubuntu jammy/main amd64 libtool all 2.4.6-15build2 [164 kB] Get:49 http://archive.ubuntu.com/ubuntu jammy/main amd64 python-is-python3 all 3.9.2-2 [2788 B] Get:50 http://archive.ubuntu.com/ubuntu jammy/universe amd64 libopenmpi-dev amd64 4.1.2-2ubuntu1 [867 kB] Fetched 24.9 MB in 5s (5274 kB/s) Extracting templates from packages: 100% Selecting previously unselected package libmagic-mgc. (Reading database ... 26673 files and directories currently installed.) Preparing to unpack .../00-libmagic-mgc_1%3a5.41-3ubuntu0.1_amd64.deb ... Unpacking libmagic-mgc (1:5.41-3ubuntu0.1) ... Selecting previously unselected package libmagic1:amd64. Preparing to unpack .../01-libmagic1_1%3a5.41-3ubuntu0.1_amd64.deb ... Unpacking libmagic1:amd64 (1:5.41-3ubuntu0.1) ... Selecting previously unselected package file. Preparing to unpack .../02-file_1%3a5.41-3ubuntu0.1_amd64.deb ... Unpacking file (1:5.41-3ubuntu0.1) ... Selecting previously unselected package libxau6:amd64. Preparing to unpack .../03-libxau6_1%3a1.0.9-1build5_amd64.deb ... Unpacking libxau6:amd64 (1:1.0.9-1build5) ... Selecting previously unselected package libxdmcp6:amd64. Preparing to unpack .../04-libxdmcp6_1%3a1.1.3-0ubuntu5_amd64.deb ... Unpacking libxdmcp6:amd64 (1:1.1.3-0ubuntu5) ... Selecting previously unselected package libxcb1:amd64. Preparing to unpack .../05-libxcb1_1.14-3ubuntu3_amd64.deb ... Unpacking libxcb1:amd64 (1.14-3ubuntu3) ... Selecting previously unselected package libx11-data. Preparing to unpack .../06-libx11-data_2%3a1.7.5-1ubuntu0.3_all.deb ... Unpacking libx11-data (2:1.7.5-1ubuntu0.3) ... Selecting previously unselected package libx11-6:amd64. Preparing to unpack .../07-libx11-6_2%3a1.7.5-1ubuntu0.3_amd64.deb ... Unpacking libx11-6:amd64 (2:1.7.5-1ubuntu0.3) ... Selecting previously unselected package libxext6:amd64. Preparing to unpack .../08-libxext6_2%3a1.3.4-1build1_amd64.deb ... Unpacking libxext6:amd64 (2:1.3.4-1build1) ... Selecting previously unselected package libsigsegv2:amd64. Preparing to unpack .../09-libsigsegv2_2.13-1ubuntu3_amd64.deb ... Unpacking libsigsegv2:amd64 (2.13-1ubuntu3) ... Selecting previously unselected package m4. Preparing to unpack .../10-m4_1.4.18-5ubuntu2_amd64.deb ... Unpacking m4 (1.4.18-5ubuntu2) ... Selecting previously unselected package autoconf. Preparing to unpack .../11-autoconf_2.71-2_all.deb ... Unpacking autoconf (2.71-2) ... Selecting previously unselected package autotools-dev. Preparing to unpack .../12-autotools-dev_20220109.1_all.deb ... Unpacking autotools-dev (20220109.1) ... Selecting previously unselected package automake. Preparing to unpack .../13-automake_1%3a1.16.5-1.3_all.deb ... Unpacking automake (1:1.16.5-1.3) ... Selecting previously unselected package libgfortran5:amd64. Preparing to unpack .../14-libgfortran5_12.3.0-1ubuntu1~22.04_amd64.deb ... Unpacking libgfortran5:amd64 (12.3.0-1ubuntu1~22.04) ... Selecting previously unselected package libgfortran-11-dev:amd64. Preparing to unpack .../15-libgfortran-11-dev_11.4.0-1ubuntu1~22.04_amd64.deb ... Unpacking libgfortran-11-dev:amd64 (11.4.0-1ubuntu1~22.04) ... Selecting previously unselected package gfortran-11. Preparing to unpack .../16-gfortran-11_11.4.0-1ubuntu1~22.04_amd64.deb ... Unpacking gfortran-11 (11.4.0-1ubuntu1~22.04) ... Selecting previously unselected package gfortran. Preparing to unpack .../17-gfortran_4%3a11.2.0-1ubuntu1_amd64.deb ... Unpacking gfortran (4:11.2.0-1ubuntu1) ... Selecting previously unselected package javascript-common. Preparing to unpack .../18-javascript-common_11+nmu1_all.deb ... Unpacking javascript-common (11+nmu1) ... Selecting previously unselected package libevent-core-2.1-7:amd64. Preparing to unpack .../19-libevent-core-2.1-7_2.1.12-stable-1build3_amd64.deb ... Unpacking libevent-core-2.1-7:amd64 (2.1.12-stable-1build3) ... Selecting previously unselected package libevent-pthreads-2.1-7:amd64. Preparing to unpack .../20-libevent-pthreads-2.1-7_2.1.12-stable-1build3_amd64.deb ... Unpacking libevent-pthreads-2.1-7:amd64 (2.1.12-stable-1build3) ... Selecting previously unselected package libpsm-infinipath1. Preparing to unpack .../21-libpsm-infinipath1_3.3+20.604758e7-6.1_amd64.deb ... Unpacking libpsm-infinipath1 (3.3+20.604758e7-6.1) ... Selecting previously unselected package libpsm2-2. Preparing to unpack .../22-libpsm2-2_11.2.185-1_amd64.deb ... Unpacking libpsm2-2 (11.2.185-1) ... Selecting previously unselected package libfabric1:amd64. Preparing to unpack .../23-libfabric1_1.11.0-3_amd64.deb ... Unpacking libfabric1:amd64 (1.11.0-3) ... Selecting previously unselected package libhwloc15:amd64. Preparing to unpack .../24-libhwloc15_2.7.0-2ubuntu1_amd64.deb ... Unpacking libhwloc15:amd64 (2.7.0-2ubuntu1) ... Selecting previously unselected package libpciaccess0:amd64. Preparing to unpack .../25-libpciaccess0_0.16-3_amd64.deb ... Unpacking libpciaccess0:amd64 (0.16-3) ... Selecting previously unselected package libxnvctrl0:amd64. Preparing to unpack .../26-libxnvctrl0_550.54.14-0ubuntu1_amd64.deb ... Unpacking libxnvctrl0:amd64 (550.54.14-0ubuntu1) ... Selecting previously unselected package ocl-icd-libopencl1:amd64. Preparing to unpack .../27-ocl-icd-libopencl1_2.2.14-3_amd64.deb ... Unpacking ocl-icd-libopencl1:amd64 (2.2.14-3) ... Selecting previously unselected package libhwloc-plugins:amd64. Preparing to unpack .../28-libhwloc-plugins_2.7.0-2ubuntu1_amd64.deb ... Unpacking libhwloc-plugins:amd64 (2.7.0-2ubuntu1) ... Selecting previously unselected package libpmix2:amd64. Preparing to unpack .../29-libpmix2_4.1.2-2ubuntu1_amd64.deb ... Unpacking libpmix2:amd64 (4.1.2-2ubuntu1) ... Selecting previously unselected package libucx0:amd64. Preparing to unpack .../30-libucx0_1.12.1~rc2-1_amd64.deb ... Unpacking libucx0:amd64 (1.12.1~rc2-1) ... Selecting previously unselected package libopenmpi3:amd64. Preparing to unpack .../31-libopenmpi3_4.1.2-2ubuntu1_amd64.deb ... Unpacking libopenmpi3:amd64 (4.1.2-2ubuntu1) ... Selecting previously unselected package libcaf-openmpi-3:amd64. Preparing to unpack .../32-libcaf-openmpi-3_2.9.2-3_amd64.deb ... Unpacking libcaf-openmpi-3:amd64 (2.9.2-3) ... Selecting previously unselected package libcoarrays-dev:amd64. Preparing to unpack .../33-libcoarrays-dev_2.9.2-3_amd64.deb ... Unpacking libcoarrays-dev:amd64 (2.9.2-3) ... Selecting previously unselected package openmpi-common. Preparing to unpack .../34-openmpi-common_4.1.2-2ubuntu1_all.deb ... Unpacking openmpi-common (4.1.2-2ubuntu1) ... Selecting previously unselected package openmpi-bin. Preparing to unpack .../35-openmpi-bin_4.1.2-2ubuntu1_amd64.deb ... Unpacking openmpi-bin (4.1.2-2ubuntu1) ... Selecting previously unselected package libcoarrays-openmpi-dev:amd64. Preparing to unpack .../36-libcoarrays-openmpi-dev_2.9.2-3_amd64.deb ... Unpacking libcoarrays-openmpi-dev:amd64 (2.9.2-3) ... Selecting previously unselected package libevent-2.1-7:amd64. Preparing to unpack .../37-libevent-2.1-7_2.1.12-stable-1build3_amd64.deb ... Unpacking libevent-2.1-7:amd64 (2.1.12-stable-1build3) ... Selecting previously unselected package libevent-extra-2.1-7:amd64. Preparing to unpack .../38-libevent-extra-2.1-7_2.1.12-stable-1build3_amd64.deb ... Unpacking libevent-extra-2.1-7:amd64 (2.1.12-stable-1build3) ... Selecting previously unselected package libevent-openssl-2.1-7:amd64. Preparing to unpack .../39-libevent-openssl-2.1-7_2.1.12-stable-1build3_amd64.deb ... Unpacking libevent-openssl-2.1-7:amd64 (2.1.12-stable-1build3) ... Selecting previously unselected package libevent-dev. Preparing to unpack .../40-libevent-dev_2.1.12-stable-1build3_amd64.deb ... Unpacking libevent-dev (2.1.12-stable-1build3) ... Selecting previously unselected package libjs-jquery. Preparing to unpack .../41-libjs-jquery_3.6.0+dfsg+~3.5.13-1_all.deb ... Unpacking libjs-jquery (3.6.0+dfsg+~3.5.13-1) ... Selecting previously unselected package libjs-jquery-ui. Preparing to unpack .../42-libjs-jquery-ui_1.13.1+dfsg-1_all.deb ... Unpacking libjs-jquery-ui (1.13.1+dfsg-1) ... Selecting previously unselected package libltdl7:amd64. Preparing to unpack .../43-libltdl7_2.4.6-15build2_amd64.deb ... Unpacking libltdl7:amd64 (2.4.6-15build2) ... Selecting previously unselected package libltdl-dev:amd64. Preparing to unpack .../44-libltdl-dev_2.4.6-15build2_amd64.deb ... Unpacking libltdl-dev:amd64 (2.4.6-15build2) ... Selecting previously unselected package libhwloc-dev:amd64. Preparing to unpack .../45-libhwloc-dev_2.7.0-2ubuntu1_amd64.deb ... Unpacking libhwloc-dev:amd64 (2.7.0-2ubuntu1) ... Selecting previously unselected package libpmix-dev:amd64. Preparing to unpack .../46-libpmix-dev_4.1.2-2ubuntu1_amd64.deb ... Unpacking libpmix-dev:amd64 (4.1.2-2ubuntu1) ... Selecting previously unselected package libtool. Preparing to unpack .../47-libtool_2.4.6-15build2_all.deb ... Unpacking libtool (2.4.6-15build2) ... Selecting previously unselected package python-is-python3. Preparing to unpack .../48-python-is-python3_3.9.2-2_all.deb ... Unpacking python-is-python3 (3.9.2-2) ... Selecting previously unselected package libopenmpi-dev:amd64. Preparing to unpack .../49-libopenmpi-dev_4.1.2-2ubuntu1_amd64.deb ... Unpacking libopenmpi-dev:amd64 (4.1.2-2ubuntu1) ... Setting up javascript-common (11+nmu1) ... Setting up libpciaccess0:amd64 (0.16-3) ... Setting up libxau6:amd64 (1:1.0.9-1build5) ... Setting up libxdmcp6:amd64 (1:1.1.3-0ubuntu5) ... Setting up libucx0:amd64 (1.12.1~rc2-1) ... Setting up libxcb1:amd64 (1.14-3ubuntu3) ... Setting up libmagic-mgc (1:5.41-3ubuntu0.1) ... Setting up libmagic1:amd64 (1:5.41-3ubuntu0.1) ... Setting up file (1:5.41-3ubuntu0.1) ... Setting up autotools-dev (20220109.1) ... Setting up libx11-data (2:1.7.5-1ubuntu0.3) ... Setting up libsigsegv2:amd64 (2.13-1ubuntu3) ... Setting up libhwloc15:amd64 (2.7.0-2ubuntu1) ... Setting up libevent-core-2.1-7:amd64 (2.1.12-stable-1build3) ... Setting up libevent-2.1-7:amd64 (2.1.12-stable-1build3) ... Setting up libltdl7:amd64 (2.4.6-15build2) ... Setting up libgfortran5:amd64 (12.3.0-1ubuntu1~22.04) ... Setting up ocl-icd-libopencl1:amd64 (2.2.14-3) ... Setting up libpsm2-2 (11.2.185-1) ... Setting up openmpi-common (4.1.2-2ubuntu1) ... Setting up libx11-6:amd64 (2:1.7.5-1ubuntu0.3) ... Setting up libpsm-infinipath1 (3.3+20.604758e7-6.1) ... update-alternatives: using /usr/lib/libpsm1/libpsm_infinipath.so.1.16 to provide /usr/lib/x86_64-linux-gnu/libpsm_infinipath.so.1 (libpsm_infinipath.so.1) in auto mode Setting up libjs-jquery (3.6.0+dfsg+~3.5.13-1) ... Setting up python-is-python3 (3.9.2-2) ... Setting up libevent-pthreads-2.1-7:amd64 (2.1.12-stable-1build3) ... Setting up libfabric1:amd64 (1.11.0-3) ... Setting up libevent-extra-2.1-7:amd64 (2.1.12-stable-1build3) ... Setting up libtool (2.4.6-15build2) ... Setting up libgfortran-11-dev:amd64 (11.4.0-1ubuntu1~22.04) ... Setting up libxext6:amd64 (2:1.3.4-1build1) ... Setting up libevent-openssl-2.1-7:amd64 (2.1.12-stable-1build3) ... Setting up m4 (1.4.18-5ubuntu2) ... Setting up libxnvctrl0:amd64 (550.54.14-0ubuntu1) ... Setting up libjs-jquery-ui (1.13.1+dfsg-1) ... Setting up libevent-dev (2.1.12-stable-1build3) ... Setting up gfortran-11 (11.4.0-1ubuntu1~22.04) ... Setting up autoconf (2.71-2) ... Setting up automake (1:1.16.5-1.3) ... update-alternatives: using /usr/bin/automake-1.16 to provide /usr/bin/automake (automake) in auto mode update-alternatives: warning: skip creation of /usr/share/man/man1/automake.1.gz because associated file /usr/share/man/man1/automake-1.16.1.gz (of link group automake) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/aclocal.1.gz because associated file /usr/share/man/man1/aclocal-1.16.1.gz (of link group automake) doesn't exist Setting up libhwloc-plugins:amd64 (2.7.0-2ubuntu1) ... Setting up gfortran (4:11.2.0-1ubuntu1) ... update-alternatives: using /usr/bin/gfortran to provide /usr/bin/f95 (f95) in auto mode update-alternatives: warning: skip creation of /usr/share/man/man1/f95.1.gz because associated file /usr/share/man/man1/gfortran.1.gz (of link group f95) doesn't exist update-alternatives: using /usr/bin/gfortran to provide /usr/bin/f77 (f77) in auto mode update-alternatives: warning: skip creation of /usr/share/man/man1/f77.1.gz because associated file /usr/share/man/man1/gfortran.1.gz (of link group f77) doesn't exist Setting up libltdl-dev:amd64 (2.4.6-15build2) ... Setting up libhwloc-dev:amd64 (2.7.0-2ubuntu1) ... Setting up libpmix2:amd64 (4.1.2-2ubuntu1) ... Setting up libcoarrays-dev:amd64 (2.9.2-3) ... Setting up libopenmpi3:amd64 (4.1.2-2ubuntu1) ... Setting up libcaf-openmpi-3:amd64 (2.9.2-3) ... Setting up libpmix-dev:amd64 (4.1.2-2ubuntu1) ... Setting up openmpi-bin (4.1.2-2ubuntu1) ... update-alternatives: using /usr/bin/mpirun.openmpi to provide /usr/bin/mpirun (mpirun) in auto mode update-alternatives: warning: skip creation of /usr/share/man/man1/mpirun.1.gz because associated file /usr/share/man/man1/mpirun.openmpi.1.gz (of link group mpirun) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpiexec.1.gz because associated file /usr/share/man/man1/mpiexec.openmpi.1.gz (of link group mpirun) doesn't exist update-alternatives: using /usr/bin/mpicc.openmpi to provide /usr/bin/mpicc (mpi) in auto mode update-alternatives: warning: skip creation of /usr/share/man/man1/mpicc.1.gz because associated file /usr/share/man/man1/mpicc.openmpi.1.gz (of link group mpi) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpic++.1.gz because associated file /usr/share/man/man1/mpic++.openmpi.1.gz (of link group mpi) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpicxx.1.gz because associated file /usr/share/man/man1/mpicxx.openmpi.1.gz (of link group mpi) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpiCC.1.gz because associated file /usr/share/man/man1/mpiCC.openmpi.1.gz (of link group mpi) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpif77.1.gz because associated file /usr/share/man/man1/mpif77.openmpi.1.gz (of link group mpi) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpif90.1.gz because associated file /usr/share/man/man1/mpif90.openmpi.1.gz (of link group mpi) doesn't exist update-alternatives: warning: skip creation of /usr/share/man/man1/mpifort.1.gz because associated file /usr/share/man/man1/mpifort.openmpi.1.gz (of link group mpi) doesn't exist Setting up libcoarrays-openmpi-dev:amd64 (2.9.2-3) ... update-alternatives: using /usr/lib/x86_64-linux-gnu/open-coarrays/openmpi/bin/caf to provide /usr/bin/caf.openmpi (caf-openmpi) in auto mode update-alternatives: using /usr/bin/caf.openmpi to provide /usr/bin/caf (caf) in auto mode Setting up libopenmpi-dev:amd64 (4.1.2-2ubuntu1) ... update-alternatives: using /usr/lib/x86_64-linux-gnu/openmpi/include to provide /usr/include/x86_64-linux-gnu/mpi (mpi-x86_64-linux-gnu) in auto mode Processing triggers for ccache (4.5.1-1) ... Updating symlinks in /usr/lib/ccache ... Processing triggers for libc-bin (2.35-0ubuntu3.5) ... root@6d80b0d9f83a:/opt/tritonserver# python --version Python 3.10.12 root@6d80b0d9f83a:/opt/tritonserver# pip3 install tensorrt_llm -U --extra-index-url https://pypi.nvidia.com Looking in indexes: https://pypi.org/simple, https://pypi.nvidia.com Collecting tensorrt_llm Downloading https://pypi.nvidia.com/tensorrt-llm/tensorrt_llm-0.8.0-cp310-cp310-linux_x86_64.whl (1126.4 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 GB 8.2 MB/s eta 0:00:00 Collecting accelerate==0.25.0 (from tensorrt_llm) Downloading accelerate-0.25.0-py3-none-any.whl.metadata (18 kB) Collecting build (from tensorrt_llm) Downloading build-1.1.1-py3-none-any.whl.metadata (4.2 kB) Collecting colored (from tensorrt_llm) Downloading colored-2.2.4-py3-none-any.whl.metadata (3.6 kB) Collecting cuda-python (from tensorrt_llm) Downloading cuda_python-12.4.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (12 kB) Collecting diffusers==0.15.0 (from tensorrt_llm) Downloading diffusers-0.15.0-py3-none-any.whl.metadata (19 kB) Collecting lark (from tensorrt_llm) Downloading lark-1.1.9-py3-none-any.whl.metadata (1.9 kB) Collecting mpi4py (from tensorrt_llm) Downloading mpi4py-3.1.5.tar.gz (2.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.5/2.5 MB 30.6 MB/s eta 0:00:00 Installing build dependencies ... done Getting requirements to build wheel ... done Preparing metadata (pyproject.toml) ... done Requirement already satisfied: numpy in /usr/local/lib/python3.10/dist-packages (from tensorrt_llm) (1.26.4) Collecting onnx>=1.12.0 (from tensorrt_llm) Downloading onnx-1.15.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (15 kB) Collecting polygraphy (from tensorrt_llm) Downloading https://pypi.nvidia.com/polygraphy/polygraphy-0.49.0-py2.py3-none-any.whl (327 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 327.9/327.9 kB 67.1 MB/s eta 0:00:00 Collecting psutil (from tensorrt_llm) Downloading psutil-5.9.8-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (21 kB) Collecting pynvml>=11.5.0 (from tensorrt_llm) Downloading pynvml-11.5.0-py3-none-any.whl.metadata (7.8 kB) Collecting sentencepiece>=0.1.99 (from tensorrt_llm) Downloading sentencepiece-0.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (7.7 kB) Requirement already satisfied: tensorrt==9.2.0.post12.dev5 in /usr/local/lib/python3.10/dist-packages (from tensorrt_llm) (9.2.0.post12.dev5) Collecting torch<=2.2.0a (from tensorrt_llm) Downloading torch-2.1.2-cp310-cp310-manylinux1_x86_64.whl.metadata (25 kB) Collecting transformers==4.36.1 (from tensorrt_llm) Downloading transformers-4.36.1-py3-none-any.whl.metadata (126 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 126.8/126.8 kB 36.8 MB/s eta 0:00:00 Requirement already satisfied: wheel in /usr/local/lib/python3.10/dist-packages (from tensorrt_llm) (0.42.0) Collecting optimum (from tensorrt_llm) Downloading optimum-1.17.1-py3-none-any.whl.metadata (18 kB) Collecting evaluate (from tensorrt_llm) Downloading evaluate-0.4.1-py3-none-any.whl.metadata (9.4 kB) Collecting janus (from tensorrt_llm) Downloading janus-1.0.0-py3-none-any.whl.metadata (4.5 kB) Collecting nvidia-ammo~=0.7.0 (from tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-ammo/nvidia_ammo-0.7.4-cp310-cp310-linux_x86_64.whl (975 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 975.6/975.6 kB 105.0 MB/s eta 0:00:00 Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from accelerate==0.25.0->tensorrt_llm) (23.2) Requirement already satisfied: pyyaml in /usr/local/lib/python3.10/dist-packages (from accelerate==0.25.0->tensorrt_llm) (6.0.1) Requirement already satisfied: huggingface-hub in /usr/local/lib/python3.10/dist-packages (from accelerate==0.25.0->tensorrt_llm) (0.20.3) Requirement already satisfied: safetensors>=0.3.1 in /usr/local/lib/python3.10/dist-packages (from accelerate==0.25.0->tensorrt_llm) (0.4.2) Collecting Pillow (from diffusers==0.15.0->tensorrt_llm) Downloading pillow-10.2.0-cp310-cp310-manylinux_2_28_x86_64.whl.metadata (9.7 kB) Requirement already satisfied: filelock in /usr/local/lib/python3.10/dist-packages (from diffusers==0.15.0->tensorrt_llm) (3.13.1) Requirement already satisfied: importlib-metadata in /usr/lib/python3/dist-packages (from diffusers==0.15.0->tensorrt_llm) (4.6.4) Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.10/dist-packages (from diffusers==0.15.0->tensorrt_llm) (2023.12.25) Requirement already satisfied: requests in /usr/local/lib/python3.10/dist-packages (from diffusers==0.15.0->tensorrt_llm) (2.31.0) Requirement already satisfied: tokenizers<0.19,>=0.14 in /usr/local/lib/python3.10/dist-packages (from transformers==4.36.1->tensorrt_llm) (0.15.2) Requirement already satisfied: tqdm>=4.27 in /usr/local/lib/python3.10/dist-packages (from transformers==4.36.1->tensorrt_llm) (4.66.2) Collecting ninja (from nvidia-ammo~=0.7.0->tensorrt_llm) Downloading ninja-1.11.1.1-py2.py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.whl.metadata (5.3 kB) Collecting networkx (from nvidia-ammo~=0.7.0->tensorrt_llm) Downloading networkx-3.2.1-py3-none-any.whl.metadata (5.2 kB) Collecting onnxruntime~=1.16.1 (from nvidia-ammo~=0.7.0->tensorrt_llm) Downloading onnxruntime-1.16.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (4.3 kB) Collecting onnx-graphsurgeon (from nvidia-ammo~=0.7.0->tensorrt_llm) Downloading https://pypi.nvidia.com/onnx-graphsurgeon/onnx_graphsurgeon-0.3.25-py2.py3-none-any.whl (40 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 40.9/40.9 kB 16.4 MB/s eta 0:00:00 Collecting scipy (from nvidia-ammo~=0.7.0->tensorrt_llm) Downloading scipy-1.12.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (60 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 60.4/60.4 kB 19.4 MB/s eta 0:00:00 Collecting protobuf>=3.20.2 (from onnx>=1.12.0->tensorrt_llm) Downloading protobuf-4.25.3-cp37-abi3-manylinux2014_x86_64.whl.metadata (541 bytes) Requirement already satisfied: typing-extensions in /usr/local/lib/python3.10/dist-packages (from torch<=2.2.0a->tensorrt_llm) (4.9.0) Collecting sympy (from torch<=2.2.0a->tensorrt_llm) Downloading sympy-1.12-py3-none-any.whl.metadata (12 kB) Collecting jinja2 (from torch<=2.2.0a->tensorrt_llm) Downloading Jinja2-3.1.3-py3-none-any.whl.metadata (3.3 kB) Requirement already satisfied: fsspec in /usr/local/lib/python3.10/dist-packages (from torch<=2.2.0a->tensorrt_llm) (2024.2.0) Collecting nvidia-cuda-nvrtc-cu12==12.1.105 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cuda-nvrtc-cu12/nvidia_cuda_nvrtc_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (23.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 124.4 MB/s eta 0:00:00 Collecting nvidia-cuda-runtime-cu12==12.1.105 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cuda-runtime-cu12/nvidia_cuda_runtime_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (823 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 823.6/823.6 kB 95.5 MB/s eta 0:00:00 Collecting nvidia-cuda-cupti-cu12==12.1.105 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cuda-cupti-cu12/nvidia_cuda_cupti_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (14.1 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 14.1/14.1 MB 142.6 MB/s eta 0:00:00 Collecting nvidia-cudnn-cu12==8.9.2.26 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cudnn-cu12/nvidia_cudnn_cu12-8.9.2.26-py3-none-manylinux1_x86_64.whl (731.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 731.7/731.7 MB 13.1 MB/s eta 0:00:00 Collecting nvidia-cublas-cu12==12.1.3.1 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cublas-cu12/nvidia_cublas_cu12-12.1.3.1-py3-none-manylinux1_x86_64.whl (410.6 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 410.6/410.6 MB 20.8 MB/s eta 0:00:00 Collecting nvidia-cufft-cu12==11.0.2.54 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cufft-cu12/nvidia_cufft_cu12-11.0.2.54-py3-none-manylinux1_x86_64.whl (121.6 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 121.6/121.6 MB 52.3 MB/s eta 0:00:00 Collecting nvidia-curand-cu12==10.3.2.106 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-curand-cu12/nvidia_curand_cu12-10.3.2.106-py3-none-manylinux1_x86_64.whl (56.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.5/56.5 MB 83.8 MB/s eta 0:00:00 Collecting nvidia-cusolver-cu12==11.4.5.107 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cusolver-cu12/nvidia_cusolver_cu12-11.4.5.107-py3-none-manylinux1_x86_64.whl (124.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 124.2/124.2 MB 58.3 MB/s eta 0:00:00 Collecting nvidia-cusparse-cu12==12.1.0.106 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-cusparse-cu12/nvidia_cusparse_cu12-12.1.0.106-py3-none-manylinux1_x86_64.whl (196.0 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 196.0/196.0 MB 43.9 MB/s eta 0:00:00 Collecting nvidia-nccl-cu12==2.18.1 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-nccl-cu12/nvidia_nccl_cu12-2.18.1-py3-none-manylinux1_x86_64.whl (209.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 209.7/209.7 MB 41.7 MB/s eta 0:00:00 Collecting nvidia-nvtx-cu12==12.1.105 (from torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-nvtx-cu12/nvidia_nvtx_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (99 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 99.1/99.1 kB 33.4 MB/s eta 0:00:00 Collecting triton==2.1.0 (from torch<=2.2.0a->tensorrt_llm) Downloading triton-2.1.0-0-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.3 kB) Collecting nvidia-nvjitlink-cu12 (from nvidia-cusolver-cu12==11.4.5.107->torch<=2.2.0a->tensorrt_llm) Downloading https://pypi.nvidia.com/nvidia-nvjitlink-cu12/nvidia_nvjitlink_cu12-12.4.99-py3-none-manylinux2014_x86_64.whl (21.1 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 21.1/21.1 MB 122.5 MB/s eta 0:00:00 Collecting pyproject_hooks (from build->tensorrt_llm) Downloading pyproject_hooks-1.0.0-py3-none-any.whl.metadata (1.3 kB) Collecting tomli>=1.1.0 (from build->tensorrt_llm) Downloading tomli-2.0.1-py3-none-any.whl.metadata (8.9 kB) Collecting datasets>=2.0.0 (from evaluate->tensorrt_llm) Downloading datasets-2.18.0-py3-none-any.whl.metadata (20 kB) Collecting dill (from evaluate->tensorrt_llm) Downloading dill-0.3.8-py3-none-any.whl.metadata (10 kB) Collecting pandas (from evaluate->tensorrt_llm) Downloading pandas-2.2.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (19 kB) Collecting xxhash (from evaluate->tensorrt_llm) Downloading xxhash-3.4.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (12 kB) Collecting multiprocess (from evaluate->tensorrt_llm) Downloading multiprocess-0.70.16-py310-none-any.whl.metadata (7.2 kB) Collecting responses<0.19 (from evaluate->tensorrt_llm) Downloading responses-0.18.0-py3-none-any.whl.metadata (29 kB) Collecting coloredlogs (from optimum->tensorrt_llm) Downloading coloredlogs-15.0.1-py2.py3-none-any.whl.metadata (12 kB) Collecting pyarrow>=12.0.0 (from datasets>=2.0.0->evaluate->tensorrt_llm) Downloading pyarrow-15.0.0-cp310-cp310-manylinux_2_28_x86_64.whl.metadata (3.0 kB) Collecting pyarrow-hotfix (from datasets>=2.0.0->evaluate->tensorrt_llm) Downloading pyarrow_hotfix-0.6-py3-none-any.whl.metadata (3.6 kB) Collecting aiohttp (from datasets>=2.0.0->evaluate->tensorrt_llm) Downloading aiohttp-3.9.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (7.4 kB) Collecting flatbuffers (from onnxruntime~=1.16.1->nvidia-ammo~=0.7.0->tensorrt_llm) Downloading flatbuffers-23.5.26-py2.py3-none-any.whl.metadata (850 bytes) Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests->diffusers==0.15.0->tensorrt_llm) (3.3.2) Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests->diffusers==0.15.0->tensorrt_llm) (3.6) Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests->diffusers==0.15.0->tensorrt_llm) (2.2.1) Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests->diffusers==0.15.0->tensorrt_llm) (2024.2.2) Collecting humanfriendly>=9.1 (from coloredlogs->optimum->tensorrt_llm) Downloading humanfriendly-10.0-py2.py3-none-any.whl.metadata (9.2 kB) Collecting MarkupSafe>=2.0 (from jinja2->torch<=2.2.0a->tensorrt_llm) Downloading MarkupSafe-2.1.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.0 kB) Collecting python-dateutil>=2.8.2 (from pandas->evaluate->tensorrt_llm) Downloading python_dateutil-2.9.0.post0-py2.py3-none-any.whl.metadata (8.4 kB) Collecting pytz>=2020.1 (from pandas->evaluate->tensorrt_llm) Downloading pytz-2024.1-py2.py3-none-any.whl.metadata (22 kB) Collecting tzdata>=2022.7 (from pandas->evaluate->tensorrt_llm) Downloading tzdata-2024.1-py2.py3-none-any.whl.metadata (1.4 kB) Collecting mpmath>=0.19 (from sympy->torch<=2.2.0a->tensorrt_llm) Downloading mpmath-1.3.0-py3-none-any.whl.metadata (8.6 kB) Collecting aiosignal>=1.1.2 (from aiohttp->datasets>=2.0.0->evaluate->tensorrt_llm) Downloading aiosignal-1.3.1-py3-none-any.whl.metadata (4.0 kB) Collecting attrs>=17.3.0 (from aiohttp->datasets>=2.0.0->evaluate->tensorrt_llm) Downloading attrs-23.2.0-py3-none-any.whl.metadata (9.5 kB) Collecting frozenlist>=1.1.1 (from aiohttp->datasets>=2.0.0->evaluate->tensorrt_llm) Downloading frozenlist-1.4.1-cp310-cp310-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (12 kB) Collecting multidict<7.0,>=4.5 (from aiohttp->datasets>=2.0.0->evaluate->tensorrt_llm) Downloading multidict-6.0.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (4.2 kB) Collecting yarl<2.0,>=1.0 (from aiohttp->datasets>=2.0.0->evaluate->tensorrt_llm) Downloading yarl-1.9.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (31 kB) Collecting async-timeout<5.0,>=4.0 (from aiohttp->datasets>=2.0.0->evaluate->tensorrt_llm) Downloading async_timeout-4.0.3-py3-none-any.whl.metadata (4.2 kB) Requirement already satisfied: six>=1.5 in /usr/lib/python3/dist-packages (from python-dateutil>=2.8.2->pandas->evaluate->tensorrt_llm) (1.16.0) Downloading accelerate-0.25.0-py3-none-any.whl (265 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 265.7/265.7 kB 41.0 MB/s eta 0:00:00 Downloading diffusers-0.15.0-py3-none-any.whl (851 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 851.8/851.8 kB 53.9 MB/s eta 0:00:00 Downloading transformers-4.36.1-py3-none-any.whl (8.3 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.3/8.3 MB 93.4 MB/s eta 0:00:00 Downloading onnx-1.15.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (15.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 15.7/15.7 MB 141.5 MB/s eta 0:00:00 Downloading pynvml-11.5.0-py3-none-any.whl (53 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 53.1/53.1 kB 17.8 MB/s eta 0:00:00 Downloading sentencepiece-0.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.3 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.3/1.3 MB 117.5 MB/s eta 0:00:00 Downloading torch-2.1.2-cp310-cp310-manylinux1_x86_64.whl (670.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 670.2/670.2 MB 16.1 MB/s eta 0:00:00 Downloading triton-2.1.0-0-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (89.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 89.2/89.2 MB 68.1 MB/s eta 0:00:00 Downloading build-1.1.1-py3-none-any.whl (19 kB) Downloading colored-2.2.4-py3-none-any.whl (16 kB) Downloading cuda_python-12.4.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (24.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 24.5/24.5 MB 119.0 MB/s eta 0:00:00 Downloading evaluate-0.4.1-py3-none-any.whl (84 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 84.1/84.1 kB 27.2 MB/s eta 0:00:00 Downloading janus-1.0.0-py3-none-any.whl (6.9 kB) Downloading lark-1.1.9-py3-none-any.whl (111 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 111.7/111.7 kB 27.5 MB/s eta 0:00:00 Downloading optimum-1.17.1-py3-none-any.whl (407 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 407.1/407.1 kB 81.9 MB/s eta 0:00:00 Downloading psutil-5.9.8-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (288 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 288.2/288.2 kB 83.4 MB/s eta 0:00:00 Downloading datasets-2.18.0-py3-none-any.whl (510 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 510.5/510.5 kB 89.1 MB/s eta 0:00:00 Downloading dill-0.3.8-py3-none-any.whl (116 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 116.3/116.3 kB 51.8 MB/s eta 0:00:00 Downloading onnxruntime-1.16.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.4 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.4/6.4 MB 159.0 MB/s eta 0:00:00 Downloading protobuf-4.25.3-cp37-abi3-manylinux2014_x86_64.whl (294 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 294.6/294.6 kB 67.1 MB/s eta 0:00:00 Downloading responses-0.18.0-py3-none-any.whl (38 kB) Downloading tomli-2.0.1-py3-none-any.whl (12 kB) Downloading coloredlogs-15.0.1-py2.py3-none-any.whl (46 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 46.0/46.0 kB 13.2 MB/s eta 0:00:00 Downloading Jinja2-3.1.3-py3-none-any.whl (133 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 133.2/133.2 kB 56.7 MB/s eta 0:00:00 Downloading multiprocess-0.70.16-py310-none-any.whl (134 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 134.8/134.8 kB 41.3 MB/s eta 0:00:00 Downloading networkx-3.2.1-py3-none-any.whl (1.6 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 112.7 MB/s eta 0:00:00 Downloading ninja-1.11.1.1-py2.py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.whl (307 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 307.2/307.2 kB 62.5 MB/s eta 0:00:00 Downloading pandas-2.2.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (13.0 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 13.0/13.0 MB 146.5 MB/s eta 0:00:00 Downloading pillow-10.2.0-cp310-cp310-manylinux_2_28_x86_64.whl (4.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 4.5/4.5 MB 153.5 MB/s eta 0:00:00 Downloading pyproject_hooks-1.0.0-py3-none-any.whl (9.3 kB) Downloading scipy-1.12.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (38.4 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 38.4/38.4 MB 122.1 MB/s eta 0:00:00 Downloading sympy-1.12-py3-none-any.whl (5.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 5.7/5.7 MB 173.4 MB/s eta 0:00:00 Downloading xxhash-3.4.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (194 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 194.1/194.1 kB 52.8 MB/s eta 0:00:00 Downloading aiohttp-3.9.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.2/1.2 MB 119.4 MB/s eta 0:00:00 Downloading humanfriendly-10.0-py2.py3-none-any.whl (86 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 86.8/86.8 kB 34.7 MB/s eta 0:00:00 Downloading MarkupSafe-2.1.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25 kB) Downloading mpmath-1.3.0-py3-none-any.whl (536 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 89.0 MB/s eta 0:00:00 Downloading pyarrow-15.0.0-cp310-cp310-manylinux_2_28_x86_64.whl (38.3 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 38.3/38.3 MB 119.1 MB/s eta 0:00:00 Downloading python_dateutil-2.9.0.post0-py2.py3-none-any.whl (229 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 229.9/229.9 kB 62.5 MB/s eta 0:00:00 Downloading pytz-2024.1-py2.py3-none-any.whl (505 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 505.5/505.5 kB 87.7 MB/s eta 0:00:00 Downloading tzdata-2024.1-py2.py3-none-any.whl (345 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 345.4/345.4 kB 78.2 MB/s eta 0:00:00 Downloading flatbuffers-23.5.26-py2.py3-none-any.whl (26 kB) Downloading pyarrow_hotfix-0.6-py3-none-any.whl (7.9 kB) Downloading aiosignal-1.3.1-py3-none-any.whl (7.6 kB) Downloading async_timeout-4.0.3-py3-none-any.whl (5.7 kB) Downloading attrs-23.2.0-py3-none-any.whl (60 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 60.8/60.8 kB 21.7 MB/s eta 0:00:00 Downloading frozenlist-1.4.1-cp310-cp310-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (239 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 239.5/239.5 kB 60.5 MB/s eta 0:00:00 Downloading multidict-6.0.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (124 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 124.3/124.3 kB 52.8 MB/s eta 0:00:00 Downloading yarl-1.9.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (301 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 301.6/301.6 kB 65.9 MB/s eta 0:00:00 Building wheels for collected packages: mpi4py Building wheel for mpi4py (pyproject.toml) ... -^done Created wheel for mpi4py: filename=mpi4py-3.1.5-cp310-cp310-linux_x86_64.whl size=2746504 sha256=eb84a77bf7ad87aaac29c0157ee81b4ea407e499e4ef8b516bb8800c1e2afbb9 Stored in directory: /root/.cache/pip/wheels/18/2b/7f/c852523089e9182b45fca50ff56f49a51eeb6284fd25a66713 Successfully built mpi4py Installing collected packages: sentencepiece, pytz, ninja, mpmath, flatbuffers, cuda-python, xxhash, tzdata, triton, tomli, sympy, scipy, python-dateutil, pynvml, pyarrow-hotfix, pyarrow, psutil, protobuf, polygraphy, Pillow, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufft-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, multidict, mpi4py, MarkupSafe, lark, janus, humanfriendly, frozenlist, dill, colored, attrs, async-timeout, yarl, responses, pyproject_hooks, pandas, onnx, nvidia-cusparse-cu12, nvidia-cudnn-cu12, multiprocess, jinja2, coloredlogs, aiosignal, onnxruntime, onnx-graphsurgeon, nvidia-cusolver-cu12, diffusers, build, aiohttp, transformers, torch, nvidia-ammo, datasets, accelerate, optimum, evaluate, tensorrt_llm Attempting uninstall: transformers Found existing installation: transformers 4.38.1 Uninstalling transformers-4.38.1: Successfully uninstalled transformers-4.38.1 Successfully installed MarkupSafe-2.1.5 Pillow-10.2.0 accelerate-0.25.0 aiohttp-3.9.3 aiosignal-1.3.1 async-timeout-4.0.3 attrs-23.2.0 build-1.1.1 colored-2.2.4 coloredlogs-15.0.1 cuda-python-12.4.0 datasets-2.18.0 diffusers-0.15.0 dill-0.3.8 evaluate-0.4.1 flatbuffers-23.5.26 frozenlist-1.4.1 humanfriendly-10.0 janus-1.0.0 jinja2-3.1.3 lark-1.1.9 mpi4py-3.1.5 mpmath-1.3.0 multidict-6.0.5 multiprocess-0.70.16 networkx-3.2.1 ninja-1.11.1.1 nvidia-ammo-0.7.4 nvidia-cublas-cu12-12.1.3.1 nvidia-cuda-cupti-cu12-12.1.105 nvidia-cuda-nvrtc-cu12-12.1.105 nvidia-cuda-runtime-cu12-12.1.105 nvidia-cudnn-cu12-8.9.2.26 nvidia-cufft-cu12-11.0.2.54 nvidia-curand-cu12-10.3.2.106 nvidia-cusolver-cu12-11.4.5.107 nvidia-cusparse-cu12-12.1.0.106 nvidia-nccl-cu12-2.18.1 nvidia-nvjitlink-cu12-12.4.99 nvidia-nvtx-cu12-12.1.105 onnx-1.15.0 onnx-graphsurgeon-0.3.25 onnxruntime-1.16.3 optimum-1.17.1 pandas-2.2.1 polygraphy-0.49.0 protobuf-4.25.3 psutil-5.9.8 pyarrow-15.0.0 pyarrow-hotfix-0.6 pynvml-11.5.0 pyproject_hooks-1.0.0 python-dateutil-2.9.0.post0 pytz-2024.1 responses-0.18.0 scipy-1.12.0 sentencepiece-0.2.0 sympy-1.12 tensorrt_llm-0.8.0 tomli-2.0.1 torch-2.1.2 transformers-4.36.1 triton-2.1.0 tzdata-2024.1 xxhash-3.4.1 yarl-1.9.4 WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv ```
Installing ammo ```bash root@6d80b0d9f83a:/opt/tritonserver# pip download --extra-index-url https://pypi.nvidia.com nvidia-ammo Looking in indexes: https://pypi.org/simple, https://pypi.nvidia.com Collecting nvidia-ammo Using cached https://pypi.nvidia.com/nvidia-ammo/nvidia_ammo-0.7.4-cp310-cp310-linux_x86_64.whl (975 kB) Collecting ninja (from nvidia-ammo) Using cached ninja-1.11.1.1-py2.py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.whl.metadata (5.3 kB) Collecting networkx (from nvidia-ammo) Using cached networkx-3.2.1-py3-none-any.whl.metadata (5.2 kB) Collecting numpy (from nvidia-ammo) Downloading numpy-1.26.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (61 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 61.0/61.0 kB 4.0 MB/s eta 0:00:00 Collecting onnx>=1.14.0 (from nvidia-ammo) Using cached onnx-1.15.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (15 kB) Collecting onnxruntime~=1.16.1 (from nvidia-ammo) Using cached onnxruntime-1.16.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (4.3 kB) Collecting onnx-graphsurgeon (from nvidia-ammo) Using cached https://pypi.nvidia.com/onnx-graphsurgeon/onnx_graphsurgeon-0.3.25-py2.py3-none-any.whl (40 kB) Collecting scipy (from nvidia-ammo) Using cached scipy-1.12.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (60 kB) Collecting torch>=1.11 (from nvidia-ammo) Downloading torch-2.2.1-cp310-cp310-manylinux1_x86_64.whl.metadata (26 kB) Collecting tqdm (from nvidia-ammo) Downloading tqdm-4.66.2-py3-none-any.whl.metadata (57 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 57.6/57.6 kB 8.2 MB/s eta 0:00:00 Collecting transformers (from nvidia-ammo) Downloading transformers-4.38.2-py3-none-any.whl.metadata (130 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 130.7/130.7 kB 15.1 MB/s eta 0:00:00 Collecting protobuf>=3.20.2 (from onnx>=1.14.0->nvidia-ammo) Using cached protobuf-4.25.3-cp37-abi3-manylinux2014_x86_64.whl.metadata (541 bytes) Collecting coloredlogs (from onnxruntime~=1.16.1->nvidia-ammo) Using cached coloredlogs-15.0.1-py2.py3-none-any.whl.metadata (12 kB) Collecting flatbuffers (from onnxruntime~=1.16.1->nvidia-ammo) Using cached flatbuffers-23.5.26-py2.py3-none-any.whl.metadata (850 bytes) Collecting packaging (from onnxruntime~=1.16.1->nvidia-ammo) Downloading packaging-23.2-py3-none-any.whl.metadata (3.2 kB) Collecting sympy (from onnxruntime~=1.16.1->nvidia-ammo) Using cached sympy-1.12-py3-none-any.whl.metadata (12 kB) Collecting filelock (from torch>=1.11->nvidia-ammo) Downloading filelock-3.13.1-py3-none-any.whl.metadata (2.8 kB) Collecting typing-extensions>=4.8.0 (from torch>=1.11->nvidia-ammo) Downloading typing_extensions-4.10.0-py3-none-any.whl.metadata (3.0 kB) Collecting jinja2 (from torch>=1.11->nvidia-ammo) Using cached Jinja2-3.1.3-py3-none-any.whl.metadata (3.3 kB) Collecting fsspec (from torch>=1.11->nvidia-ammo) Downloading fsspec-2024.2.0-py3-none-any.whl.metadata (6.8 kB) Collecting nvidia-cuda-nvrtc-cu12==12.1.105 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cuda-nvrtc-cu12/nvidia_cuda_nvrtc_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (23.7 MB) Collecting nvidia-cuda-runtime-cu12==12.1.105 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cuda-runtime-cu12/nvidia_cuda_runtime_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (823 kB) Collecting nvidia-cuda-cupti-cu12==12.1.105 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cuda-cupti-cu12/nvidia_cuda_cupti_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (14.1 MB) Collecting nvidia-cudnn-cu12==8.9.2.26 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cudnn-cu12/nvidia_cudnn_cu12-8.9.2.26-py3-none-manylinux1_x86_64.whl (731.7 MB) Collecting nvidia-cublas-cu12==12.1.3.1 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cublas-cu12/nvidia_cublas_cu12-12.1.3.1-py3-none-manylinux1_x86_64.whl (410.6 MB) Collecting nvidia-cufft-cu12==11.0.2.54 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cufft-cu12/nvidia_cufft_cu12-11.0.2.54-py3-none-manylinux1_x86_64.whl (121.6 MB) Collecting nvidia-curand-cu12==10.3.2.106 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-curand-cu12/nvidia_curand_cu12-10.3.2.106-py3-none-manylinux1_x86_64.whl (56.5 MB) Collecting nvidia-cusolver-cu12==11.4.5.107 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cusolver-cu12/nvidia_cusolver_cu12-11.4.5.107-py3-none-manylinux1_x86_64.whl (124.2 MB) Collecting nvidia-cusparse-cu12==12.1.0.106 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-cusparse-cu12/nvidia_cusparse_cu12-12.1.0.106-py3-none-manylinux1_x86_64.whl (196.0 MB) Collecting nvidia-nccl-cu12==2.19.3 (from torch>=1.11->nvidia-ammo) Downloading https://pypi.nvidia.com/nvidia-nccl-cu12/nvidia_nccl_cu12-2.19.3-py3-none-manylinux1_x86_64.whl (158.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.7/158.7 MB 48.7 MB/s eta 0:00:00 Collecting nvidia-nvtx-cu12==12.1.105 (from torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-nvtx-cu12/nvidia_nvtx_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (99 kB) Collecting triton==2.2.0 (from torch>=1.11->nvidia-ammo) Downloading triton-2.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (1.4 kB) Collecting nvidia-nvjitlink-cu12 (from nvidia-cusolver-cu12==11.4.5.107->torch>=1.11->nvidia-ammo) Using cached https://pypi.nvidia.com/nvidia-nvjitlink-cu12/nvidia_nvjitlink_cu12-12.4.99-py3-none-manylinux2014_x86_64.whl (21.1 MB) Collecting huggingface-hub<1.0,>=0.19.3 (from transformers->nvidia-ammo) Downloading huggingface_hub-0.21.4-py3-none-any.whl.metadata (13 kB) Collecting pyyaml>=5.1 (from transformers->nvidia-ammo) Downloading PyYAML-6.0.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (2.1 kB) Collecting regex!=2019.12.17 (from transformers->nvidia-ammo) Downloading regex-2023.12.25-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (40 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 40.9/40.9 kB 12.0 MB/s eta 0:00:00 Collecting requests (from transformers->nvidia-ammo) Downloading requests-2.31.0-py3-none-any.whl.metadata (4.6 kB) Collecting tokenizers<0.19,>=0.14 (from transformers->nvidia-ammo) Downloading tokenizers-0.15.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.7 kB) Collecting safetensors>=0.4.1 (from transformers->nvidia-ammo) Downloading safetensors-0.4.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.8 kB) Collecting humanfriendly>=9.1 (from coloredlogs->onnxruntime~=1.16.1->nvidia-ammo) Using cached humanfriendly-10.0-py2.py3-none-any.whl.metadata (9.2 kB) Collecting MarkupSafe>=2.0 (from jinja2->torch>=1.11->nvidia-ammo) Using cached MarkupSafe-2.1.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.0 kB) Collecting charset-normalizer<4,>=2 (from requests->transformers->nvidia-ammo) Downloading charset_normalizer-3.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (33 kB) Collecting idna<4,>=2.5 (from requests->transformers->nvidia-ammo) Downloading idna-3.6-py3-none-any.whl.metadata (9.9 kB) Collecting urllib3<3,>=1.21.1 (from requests->transformers->nvidia-ammo) Downloading urllib3-2.2.1-py3-none-any.whl.metadata (6.4 kB) Collecting certifi>=2017.4.17 (from requests->transformers->nvidia-ammo) Downloading certifi-2024.2.2-py3-none-any.whl.metadata (2.2 kB) Collecting mpmath>=0.19 (from sympy->onnxruntime~=1.16.1->nvidia-ammo) Using cached mpmath-1.3.0-py3-none-any.whl.metadata (8.6 kB) Using cached onnx-1.15.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (15.7 MB) Using cached onnxruntime-1.16.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.4 MB) Downloading numpy-1.26.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (18.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 18.2/18.2 MB 126.7 MB/s eta 0:00:00 Downloading torch-2.2.1-cp310-cp310-manylinux1_x86_64.whl (755.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 755.5/755.5 MB 13.5 MB/s eta 0:00:00 Downloading triton-2.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (167.9 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 167.9/167.9 MB 45.4 MB/s eta 0:00:00 Using cached networkx-3.2.1-py3-none-any.whl (1.6 MB) Using cached ninja-1.11.1.1-py2.py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.whl (307 kB) Using cached scipy-1.12.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (38.4 MB) Downloading tqdm-4.66.2-py3-none-any.whl (78 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 78.3/78.3 kB 23.5 MB/s eta 0:00:00 Downloading transformers-4.38.2-py3-none-any.whl (8.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.5/8.5 MB 166.9 MB/s eta 0:00:00 Downloading huggingface_hub-0.21.4-py3-none-any.whl (346 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 346.4/346.4 kB 74.1 MB/s eta 0:00:00 Downloading fsspec-2024.2.0-py3-none-any.whl (170 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 170.9/170.9 kB 59.3 MB/s eta 0:00:00 Downloading packaging-23.2-py3-none-any.whl (53 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 53.0/53.0 kB 16.1 MB/s eta 0:00:00 Using cached protobuf-4.25.3-cp37-abi3-manylinux2014_x86_64.whl (294 kB) Downloading PyYAML-6.0.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (705 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 705.5/705.5 kB 116.9 MB/s eta 0:00:00 Downloading regex-2023.12.25-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (773 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 774.0/774.0 kB 133.3 MB/s eta 0:00:00 Downloading safetensors-0.4.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.3 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.3/1.3 MB 109.1 MB/s eta 0:00:00 Downloading tokenizers-0.15.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.6 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 3.6/3.6 MB 152.6 MB/s eta 0:00:00 Downloading typing_extensions-4.10.0-py3-none-any.whl (33 kB) Using cached coloredlogs-15.0.1-py2.py3-none-any.whl (46 kB) Downloading filelock-3.13.1-py3-none-any.whl (11 kB) Using cached flatbuffers-23.5.26-py2.py3-none-any.whl (26 kB) Using cached Jinja2-3.1.3-py3-none-any.whl (133 kB) Downloading requests-2.31.0-py3-none-any.whl (62 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 62.6/62.6 kB 21.2 MB/s eta 0:00:00 Using cached sympy-1.12-py3-none-any.whl (5.7 MB) Downloading certifi-2024.2.2-py3-none-any.whl (163 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 163.8/163.8 kB 61.5 MB/s eta 0:00:00 Downloading charset_normalizer-3.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (142 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 142.1/142.1 kB 41.7 MB/s eta 0:00:00 Using cached humanfriendly-10.0-py2.py3-none-any.whl (86 kB) Downloading idna-3.6-py3-none-any.whl (61 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 61.6/61.6 kB 26.7 MB/s eta 0:00:00 Using cached MarkupSafe-2.1.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25 kB) Using cached mpmath-1.3.0-py3-none-any.whl (536 kB) Downloading urllib3-2.2.1-py3-none-any.whl (121 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 121.1/121.1 kB 54.1 MB/s eta 0:00:00 Saved ./nvidia_ammo-0.7.4-cp310-cp310-linux_x86_64.whl Saved ./onnx-1.15.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./onnxruntime-1.16.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./numpy-1.26.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./torch-2.2.1-cp310-cp310-manylinux1_x86_64.whl Saved ./nvidia_cublas_cu12-12.1.3.1-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cuda_cupti_cu12-12.1.105-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cuda_nvrtc_cu12-12.1.105-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cuda_runtime_cu12-12.1.105-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cudnn_cu12-8.9.2.26-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cufft_cu12-11.0.2.54-py3-none-manylinux1_x86_64.whl Saved ./nvidia_curand_cu12-10.3.2.106-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cusolver_cu12-11.4.5.107-py3-none-manylinux1_x86_64.whl Saved ./nvidia_cusparse_cu12-12.1.0.106-py3-none-manylinux1_x86_64.whl Saved ./nvidia_nccl_cu12-2.19.3-py3-none-manylinux1_x86_64.whl Saved ./nvidia_nvtx_cu12-12.1.105-py3-none-manylinux1_x86_64.whl Saved ./triton-2.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./networkx-3.2.1-py3-none-any.whl Saved ./ninja-1.11.1.1-py2.py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.whl Saved ./onnx_graphsurgeon-0.3.25-py2.py3-none-any.whl Saved ./scipy-1.12.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./tqdm-4.66.2-py3-none-any.whl Saved ./transformers-4.38.2-py3-none-any.whl Saved ./huggingface_hub-0.21.4-py3-none-any.whl Saved ./fsspec-2024.2.0-py3-none-any.whl Saved ./packaging-23.2-py3-none-any.whl Saved ./protobuf-4.25.3-cp37-abi3-manylinux2014_x86_64.whl Saved ./PyYAML-6.0.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./regex-2023.12.25-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./safetensors-0.4.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./tokenizers-0.15.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./typing_extensions-4.10.0-py3-none-any.whl Saved ./coloredlogs-15.0.1-py2.py3-none-any.whl Saved ./filelock-3.13.1-py3-none-any.whl Saved ./flatbuffers-23.5.26-py2.py3-none-any.whl Saved ./Jinja2-3.1.3-py3-none-any.whl Saved ./requests-2.31.0-py3-none-any.whl Saved ./sympy-1.12-py3-none-any.whl Saved ./certifi-2024.2.2-py3-none-any.whl Saved ./charset_normalizer-3.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./humanfriendly-10.0-py2.py3-none-any.whl Saved ./idna-3.6-py3-none-any.whl Saved ./MarkupSafe-2.1.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl Saved ./mpmath-1.3.0-py3-none-any.whl Saved ./urllib3-2.2.1-py3-none-any.whl Saved ./nvidia_nvjitlink_cu12-12.4.99-py3-none-manylinux2014_x86_64.whl Successfully downloaded nvidia-ammo onnx onnxruntime numpy torch nvidia-cublas-cu12 nvidia-cuda-cupti-cu12 nvidia-cuda-nvrtc-cu12 nvidia-cuda-runtime-cu12 nvidia-cudnn-cu12 nvidia-cufft-cu12 nvidia-curand-cu12 nvidia-cusolver-cu12 nvidia-cusparse-cu12 nvidia-nccl-cu12 nvidia-nvtx-cu12 triton networkx ninja onnx-graphsurgeon scipy tqdm transformers huggingface-hub fsspec packaging protobuf pyyaml regex safetensors tokenizers typing-extensions coloredlogs filelock flatbuffers jinja2 requests sympy certifi charset-normalizer humanfriendly idna MarkupSafe mpmath urllib3 nvidia-nvjitlink-cu12 ```

Could you try again and share the full log?

pfldy2850 commented 6 months ago

Thank you for your response, @byshiue

I am now doubting the following log part of what I wrote.

Looking in indexes: https://pypi.org/simple, https://pypi.nvidia.com
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /tensorrt-llm/

I'm guessing from those logs that the firewall or network settings in my environment are preventing normal access to the pypi index.

plt12138 commented 6 months ago

I get the same error when trying to install tensorrt-llm-0.8.0 in docker:

docker pull nvcr.io/nvidia/tritonserver:24.02-trtllm-python-py3
...
pip install tensorrt_llm-0.8.0-cp310-cp310-linux_x86_64.whl

Error:


Collecting nvidia-ammo~=0.7.0 (from tensorrt-llm==0.8.0)
  Downloading nvidia-ammo-0.7.4.tar.gz (6.9 kB)
  Preparing metadata (setup.py) ... error 
  error: subprocess-exited-with-error

  x python setup.py egg info did not run successfully.
     exit code: 1
         [6 lines of output]
         Traceback (most recent call last):
            File "<string>", line 2, in <module>
            File "<pip-setuptools-caller>", line 34, in <module>
            File "/tmp/pip-install-kif3gunq/nvidia-ammo_2f5a5762a60446e69eb1c0693b55ac14/setup.py", line 90, in <module>
                raise RuntimeError("Bad params")
         RuntimeError: Bad params
         [end of output]
    note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed
byshiue commented 5 months ago

Might you try running

pip install requirements-dev.txt
pankajroark commented 4 months ago

I ran into this as well and found the root cause I think. If someone runs into this again, issue was that 0.7.* versions of nvidia-ammo on pypi seem broken. It's important to add --extra-index-url https://pypi.nvidia.com. The version on pypi.nvidia.com seem to be working.