Closed radna0 closed 1 month ago
NAME="Ubuntu" VERSION="22.04.4 LTS (Jammy Jellyfish)" CPU: model name : AMD Ryzen 5 3600 6-Core Processor GPU: Name: AMD Ryzen 5 3600 6-Core Processor Marketing Name: AMD Ryzen 5 3600 6-Core Processor Name: gfx900 Marketing Name: Radeon RX Vega Name: amdgcn-amd-amdhsa--gfx900:xnack-
(base) r4-0@r40-desktop:~/triton-triton-mlir/python$ MAX_JOBS=8 pip3 install -e . Obtaining file:///home/r4-0/triton-triton-mlir/python Installing build dependencies ... done Checking if build backend supports build_editable ... done Getting requirements to build editable ... done Preparing editable metadata (pyproject.toml) ... done Requirement already satisfied: filelock in /home/r4-0/miniconda3/lib/python3.12/site-packages (from triton==2.1.0) (3.14.0) Building wheels for collected packages: triton Building editable for triton (pyproject.toml) ... -
NAME="Ubuntu" VERSION="22.04.4 LTS (Jammy Jellyfish)"
AMD Ryzen 5 3600 6-Core Processor
AMD Radeon RX Vega
ROCm 6.0.0
No response
git clone https://github.com/ROCmSoftwarePlatform/triton.git cd triton git checkout triton-mlir cd python pip3 install ninja cmake; # build time dependencies
pip3 install -e .
(base) r4-0@r40-desktop:~$ /opt/rocm/bin/rocminfo --support ROCk module is loaded ===================== HSA System Attributes ===================== Runtime Version: 1.1 System Timestamp Freq.: 1000.000000MHz Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) Machine Model: LARGE System Endianness: LITTLE Mwaitx: DISABLED DMAbuf Support: YES ========== HSA Agents ========== ******* Agent 1 ******* Name: AMD Ryzen 5 3600 6-Core Processor Uuid: CPU-XX Marketing Name: AMD Ryzen 5 3600 6-Core Processor Vendor Name: CPU Feature: None specified Profile: FULL_PROFILE Float Round Mode: NEAR Max Queue Number: 0(0x0) Queue Min Size: 0(0x0) Queue Max Size: 0(0x0) Queue Type: MULTI Node: 0 Device Type: CPU Cache Info: L1: 32768(0x8000) KB Chip ID: 0(0x0) ASIC Revision: 0(0x0) Cacheline Size: 64(0x40) Max Clock Freq. (MHz): 3600 BDFID: 0 Internal Node ID: 0 Compute Unit: 12 SIMDs per CU: 0 Shader Engines: 0 Shader Arrs. per Eng.: 0 WatchPts on Addr. Ranges:1 Features: None Pool Info: Pool 1 Segment: GLOBAL; FLAGS: FINE GRAINED Size: 32779452(0x1f42cbc) KB Allocatable: TRUE Alloc Granule: 4KB Alloc Alignment: 4KB Accessible by all: TRUE Pool 2 Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED Size: 32779452(0x1f42cbc) KB Allocatable: TRUE Alloc Granule: 4KB Alloc Alignment: 4KB Accessible by all: TRUE Pool 3 Segment: GLOBAL; FLAGS: COARSE GRAINED Size: 32779452(0x1f42cbc) KB Allocatable: TRUE Alloc Granule: 4KB Alloc Alignment: 4KB Accessible by all: TRUE ISA Info: ******* Agent 2 ******* Name: gfx900 Uuid: GPU-021501aee71248a4 Marketing Name: Radeon RX Vega Vendor Name: AMD Feature: KERNEL_DISPATCH Profile: BASE_PROFILE Float Round Mode: NEAR Max Queue Number: 128(0x80) Queue Min Size: 64(0x40) Queue Max Size: 131072(0x20000) Queue Type: MULTI Node: 1 Device Type: GPU Cache Info: L1: 16(0x10) KB L2: 4096(0x1000) KB Chip ID: 26751(0x687f) ASIC Revision: 1(0x1) Cacheline Size: 64(0x40) Max Clock Freq. (MHz): 1630 BDFID: 2304 Internal Node ID: 1 Compute Unit: 64 SIMDs per CU: 4 Shader Engines: 4 Shader Arrs. per Eng.: 1 WatchPts on Addr. Ranges:4 Coherent Host Access: FALSE Features: KERNEL_DISPATCH Fast F16 Operation: TRUE Wavefront Size: 64(0x40) Workgroup Max Size: 1024(0x400) Workgroup Max Size per Dimension: x 1024(0x400) y 1024(0x400) z 1024(0x400) Max Waves Per CU: 40(0x28) Max Work-item Per CU: 2560(0xa00) Grid Max Size: 4294967295(0xffffffff) Grid Max Size per Dimension: x 4294967295(0xffffffff) y 4294967295(0xffffffff) z 4294967295(0xffffffff) Max fbarriers/Workgrp: 32 Packet Processor uCode:: 469 SDMA engine uCode:: 434 IOMMU Support:: None Pool Info: Pool 1 Segment: GLOBAL; FLAGS: COARSE GRAINED Size: 8372224(0x7fc000) KB Allocatable: TRUE Alloc Granule: 4KB Alloc Alignment: 4KB Accessible by all: FALSE Pool 2 Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED Size: 8372224(0x7fc000) KB Allocatable: TRUE Alloc Granule: 4KB Alloc Alignment: 4KB Accessible by all: FALSE Pool 3 Segment: GROUP Size: 64(0x40) KB Allocatable: FALSE Alloc Granule: 0KB Alloc Alignment: 0KB Accessible by all: FALSE ISA Info: ISA 1 Name: amdgcn-amd-amdhsa--gfx900:xnack- Machine Models: HSA_MACHINE_MODEL_LARGE Profiles: HSA_PROFILE_BASE Default Rounding Mode: NEAR Default Rounding Mode: NEAR Fast f16: TRUE Workgroup Max Size: 1024(0x400) Workgroup Max Size per Dimension: x 1024(0x400) y 1024(0x400) z 1024(0x400) Grid Max Size: 4294967295(0xffffffff) Grid Max Size per Dimension: x 4294967295(0xffffffff) y 4294967295(0xffffffff) z 4294967295(0xffffffff) FBarrier Max Size: 32 *** Done ***
We don't support gfx900 any more.
Problem Description
Triton not building/stuck
INFO:
Operating System
NAME="Ubuntu" VERSION="22.04.4 LTS (Jammy Jellyfish)"
CPU
AMD Ryzen 5 3600 6-Core Processor
GPU
AMD Radeon RX Vega
ROCm Version
ROCm 6.0.0
ROCm Component
No response
Steps to Reproduce
git clone https://github.com/ROCmSoftwarePlatform/triton.git cd triton git checkout triton-mlir cd python pip3 install ninja cmake; # build time dependencies
Stuck/Not Building
pip3 install -e .
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
Additional Information
No response