Closed wznxn closed 10 years ago
@wznxn Can you post the test code as a gist ?
@pavanky The testing_sgemm program is part of the clMagma download suite from the ICL website; we are currently trying to reproduce this issue here in our labs. I will assign the issue to us to represent that we are working on it.
We believe the bug has been fixed in the latest beta public driver: 13.11 Beta. Users running into this issue should try with the beta driver to confirm resolution of this failure.
AMD has released the GA version of Catalyst 13.12. With this driver release, this issue is fixed.
I tested clmagma1.0.0 and clBLAS with AMD FirePro W8000, and got some weird testing results below.
Both MKL 11.0.2.146 and Acml5.3.0 are used to obtain the same problems.
We are wondering whether it is a driver issue. Have you seen this kind of errors before? Any suggestion? Thanks.
./testing_sgemm
M N K clAmdBlas GFLop/s (sec) CPU GFlop/s (sec) error
1024 1024 1024 335.91 ( 0.01) 50.63 ( 0.04) 1.831055e-04 1280 1280 1280 614.83 ( 0.01) 103.87 ( 0.04) 2.746582e-04 1600 1600 1600 605.87 ( 0.01) 138.04 ( 0.06) 3.433228e-04 2000 2000 2000 1198.67 ( 0.01) 136.84 ( 0.12) 4.577637e-04 2500 2500 2500 1173.58 ( 0.03) 37.49 ( 0.83) 8.087158e-04 3125 3125 3125 1250.90 ( 0.05) 151.47 ( 0.40) 1.068115e-03 3906 3906 3906 1226.68 ( 0.10) 166.15 ( 0.72) 1.312256e-03 4882 4882 4882 1250.68 ( 0.19) 194.26 ( 1.20) nan 6102 6102 6102 1260.94 ( 0.36) 217.01 ( 2.09) nan
./testing_dgemm
M N K clAmdBlas GFLop/s (sec) CPU GFlop/s (sec) error
1024 1024 1024 210.07 ( 0.01) 39.60 ( 0.05) 3.126388e-13 1280 1280 1280 334.66 ( 0.01) 55.77 ( 0.08) 5.115908e-13 1600 1600 1600 501.54 ( 0.02) 36.27 ( 0.23) 6.536993e-13 2000 2000 2000 505.43 ( 0.03) 71.73 ( 0.22) 8.242296e-13 2500 2500 2500 482.68 ( 0.06) 83.53 ( 0.37) nan 3125 3125 3125 431.77 ( 0.14) 94.39 ( 0.65) nan 3906 3906 3906 424.89 ( 0.28) 100.95 ( 1.18) nan 4882 4882 4882 419.76 ( 0.55) 112.94 ( 2.06) nan 6102 6102 6102 422.29 ( 1.08) 128.65 ( 3.53) 5.627498e-12
./testing_sgesv_gpu
N NRHS GPU GFlop/s (sec) ||B - AX|| / ||A||*||X||
1024 100 31.37 ( 0.03) 7.35e-08 2048 100 77.00 ( 0.09) 1.35e-07 3072 100 130.27 ( 0.16) 2.02e-07 4032 100 214.66 ( 0.22) 2.27e-07 5184 100 270.23 ( 0.36) nan magma_sposv had error 257. 6384 100 308.80 ( 0.59) nan magma_sposv had error 195. 8385 100 434.03 ( 0.94) nan
According to clinfo, Max memory allocation is 1073741824. Global memory size is 3221225472.
From dmesg: [fglrx] module loaded - fglrx 12.10.5 [May 30 2013]