Closed park12sj closed 7 months ago
cpu memory? 64GB may need when compile.
@Tlntin
Oh, it's 50GB... I'll go over 64GB and try again. I wonder if you have any opinions about the nvidia-ammo installation error.
I wonder if you have any opinions about the nvidia-ammo installation error.
what's your python version? nvidia-ammo only support python 3.10.
@Tlntin
I am using 3.10 version.
root@personal-sangjune-trt-llm-0:/workspace/storage/cephfs-personal/git/pai/paip-TensorRT-LLM# python --version
Python 3.10.12
I'm trying to do a whl build in the container for build according to the guide https://github.com/NVIDIA/TensorRT-LLM/blob/main/docs/source/build_from_source.md#on-systems-without-gnu-make
oh, maybe other error, i will watch nvidia ammo setup.py source code after a moment.
maybe you can try install nvidia-ammo with this command
pip install --no-cache-dir --extra-index-url https://pypi.nvidia.com nvidia-ammo~=0.5.0
when you install nvidia-ammo, the torch, tensorrt-llm, mpi4py must have installed.
Thank you for the guidance.
By the way, nvidia-ammo is a requirement for tensorrt-llm, when installing nvidia-ammo, if tensorrt-llm is required to be installed, is there no dependency between each other? https://github.com/NVIDIA/TensorRT-LLM/blob/c89653021e66ca78c55f02b366f404455bc12e8d/requirements.txt#L18
Thank you for the guidance.
By the way, nvidia-ammo is a requirement for tensorrt-llm, when installing nvidia-ammo, if tensorrt-llm is required to be installed, is there no dependency between each other?
This may be a bug, in the previous 0.7.0/0.6.0/0.5.0 versions, tensorrt-llm was installed first, and then nvidia-ammo. I haven't tried the compilation of 0.7.1 though, so I can't be sure yet.
Thank you for your answer. I'll close the issue.
System Info
+---------------------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=======================================================================================| | No running processes found | +---------------------------------------------------------------------------------------+
Collecting nvidia-ammo~=0.5.0 (from -r requirements.txt (line 18)) Downloading nvidia-ammo-0.5.1.tar.gz (6.9 kB) Preparing metadata (setup.py) ... error error: subprocess-exited-with-error
× python setup.py egg_info did not run successfully. │ exit code: 1 ╰─> [6 lines of output] Traceback (most recent call last): File "", line 2, in
File "", line 34, in
File "/tmp/pip-install-cwksnwhn/nvidia-ammo_375eeb4e0f7248fea8e87aaffc9f0eec/setup.py", line 90, in
raise RuntimeError("Bad params")
RuntimeError: Bad params
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip. error: metadata-generation-failed
[ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention64_half.cu.o [ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention80_bf16.cu.o [ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention80_float.cu.o [ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention80_half.cu.o [ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention96_bf16.cu.o [ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention96_float.cu.o [ 93%] Building CUDA object tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention96_half.cu.o nvcc error : 'ptxas' died due to signal 9 (Kill signal) gmake[3]: [tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/build.make:8149: tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention144_half.cu.o] Error 9 gmake[3]: Waiting for unfinished jobs.... nvcc error : 'ptxas' died due to signal 9 (Kill signal) gmake[3]: [tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/build.make:8164: tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/decoderMaskedMultiheadAttention/decoderMaskedMultiheadAttention160_bf16.cu.o] Error 9 gmake[2]: [CMakeFiles/Makefile2:865: tensorrt_llm/kernels/CMakeFiles/kernels_src.dir/all] Error 2 gmake[1]: [CMakeFiles/Makefile2:790: tensorrt_llm/CMakeFiles/tensorrt_llm.dir/rule] Error 2 gmake: [Makefile:192: tensorrt_llm] Error 2 Traceback (most recent call last): File "/workspace/storage/cephfs-personal/git/pai/paip-TensorRT-LLM/./scripts/build_wheel.py", line 306, in
main(**vars(args))
File "/workspace/storage/cephfs-personal/git/pai/paip-TensorRT-LLM/./scripts/build_wheel.py", line 164, in main
build_run(
File "/usr/lib/python3.10/subprocess.py", line 526, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command 'cmake --build . --config Release --parallel 40 --target tensorrt_llm tensorrt_llm_static nvinfer_plugin_tensorrt_llm th_common bindings ' returned non-zero exit status 2.