ROCm / hipBLASLt

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
https://rocm.docs.amd.com/projects/hipBLASLt/en/latest/index.html
MIT License
49 stars 80 forks source link

gtest fix for int8_matmul #838

Closed AndySu12 closed 3 months ago

AndySu12 commented 3 months ago
AndySu12 commented 3 months ago

you should enlarge the size but not make this issue invisible. The solution should be making the CPU cast formula same with GPU cast formula

Done.

AndySu12 commented 3 months ago

gfx942 local test passed.

hipBLASLt version: 800

Query device success: there are 8 devices

Device ID 0 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 1 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 2 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 3 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 4 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 5 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 6 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

Device ID 7 : AMD Instinct MI300X gfx942:sramecc+:xnack- with 206.1 GB memory, max. SCLK 2100 MHz, max. MCLK 1300 MHz, compute capability 9.4 maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64

info: parsing of test data may take a couple minutes before any test output appears...

Note: Google Test filter = dst_i8 [==========] Running 64 tests from 1 test suite. [----------] Global test environment set-up. [----------] 64 tests from _/matmultest [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_0_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_0_128_1281 (13378 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_0_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_0_128_128_1SAV (67 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_2_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_2_128_1281 (65 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_2_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_128_128_128_1_128_128_2_128_128_1SAV (66 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_0_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_0_131_1311 (67 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_0_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_0_131_131_1SAV (66 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_2_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_2_131_1311 (65 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_2_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_131_131_131_1_131_131_2_131_131_1SAV (66 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_0_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_0_1024_10241 (652 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV (625 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_2_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_2_1024_10241 (598 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV (598 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_0_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_0_1031_10311 (598 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV (623 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_2_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_2_1031_10311 (592 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NN_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV (596 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_0_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_0_128_1281 (27 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_0_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_0_128_128_1SAV (27 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_2_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_2_128_1281 (27 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_2_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_128_128_128_1_128_128_2_128_128_1SAV (34 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_0_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_0_131_1311 (28 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_0_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_0_131_131_1SAV (41 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_2_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_2_131_1311 (27 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_2_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_131_131_131_1_131_131_2_131_131_1SAV (27 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_0_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_0_1024_10241 (833 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV (830 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_2_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_2_1024_10241 (801 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV (810 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_0_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_0_1031_10311 (843 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV (811 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_2_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_2_1031_10311 (826 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_NT_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV (854 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_0_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_0_128_1281 (33 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_0_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_0_128_128_1SAV (25 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_2_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_2_128_1281 (25 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_2_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_128_128_128_1_128_128_2_128_128_1SAV (24 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_0_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_0_131_1311 (33 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_0_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_0_131_131_1SAV (33 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_2_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_2_131_1311 (25 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_2_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_131_131_131_1_131_131_2_131_131_1SAV (31 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_0_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_0_1024_10241 (1048 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV (1074 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_2_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_2_1024_10241 (1056 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV (1070 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_0_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_0_1031_10311 (1067 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV (1104 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_2_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_2_1031_10311 (1064 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TN_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV (1101 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_0_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_0_128_1281 (31 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_0_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_0_128_128_1SAV (28 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_2_128_1281 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_2_128_1281 (26 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_2_128_128_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_128_128_128_1_128_128_2_128_128_1SAV (36 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_0_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_0_131_1311 (25 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_0_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_0_131_131_1SAV (28 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_2_131_1311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_2_131_1311 (31 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_2_131_131_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_131_131_131_1_131_131_2_131_131_1SAV (24 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_0_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_0_1024_10241 (3404 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_0_1024_1024_1SAV (3444 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_2_1024_10241 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_2_1024_10241 (3423 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1024_1024_1024_1_1024_1024_2_1024_1024_1SAV (3437 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_0_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_0_1031_10311 (1068 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_0_1031_1031_1SAV (1080 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_2_1031_10311 [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_2_1031_10311 (1056 ms) [ RUN ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV [ OK ] /matmul_test.matmul/pre_checkin_matmul_gemm_i8_dst_i8_94x_i8_ri8_ri8_ri8_ri32_r_relu_TT_1031_1031_1031_1_1031_1031_2_1031_1031_1SAV (1105 ms) [----------] 64 tests from /matmul_test (52666 ms total)

[----------] Global test environment tear-down [==========] 64 tests from 1 test suite ran. (52668 ms total) [ PASSED ] 64 tests. hipBLASLt version: 800

command line: ./clients/staging/hipblaslt-test --gtest_filter=dst_i8