geerlingguy / top500-benchmark

Automated Top500 benchmark for clusters or single nodes.
MIT License
190 stars 18 forks source link

Benchmark LattePanda Mu #30

Closed geerlingguy closed 6 months ago

geerlingguy commented 6 months ago

As the title says... see https://github.com/geerlingguy/sbc-reviews/issues/42

This system uses a single Intel N100 CPU with unlocked TDP thanks to an aluminum active cooler.

geerlingguy commented 6 months ago
Screenshot 2024-05-04 at 8 39 40 PM

Power usage during the benchmark averages 26W, with peaks to 27W.

geerlingguy commented 6 months ago

63.225 Gflops at 26W, giving 2.43 Gflops/W

================================================================================
HPLinpack 2.3  --  High-Performance Linpack benchmark  --   December 2, 2018
Written by A. Petitet and R. Clint Whaley,  Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================

An explanation of the input/output parameters follows:
T/V    : Wall time / encoded variant.
N      : The order of the coefficient matrix A.
NB     : The partitioning blocking factor.
P      : The number of process rows.
Q      : The number of process columns.
Time   : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.

The following parameter values will be used:

N      :   23314
NB     :     256
PMAP   : Row-major process mapping
P      :       1
Q      :       4
PFACT  :   Right
NBMIN  :       4
NDIV   :       2
RFACT  :   Crout
BCAST  :  1ringM
DEPTH  :       1
SWAP   : Mix (threshold = 64)
L1     : transposed form
U      : transposed form
EQUIL  : yes
ALIGN  : 8 double precision words

--------------------------------------------------------------------------------

- The matrix A is randomly generated for each test.
- The following scaled residual check will be computed:
      ||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be               1.110223e-16
- Computational tests pass if scaled residuals are less than                16.0

================================================================================
T/V                N    NB     P     Q               Time                 Gflops
--------------------------------------------------------------------------------
WR11C2R4       23314   256     1     4             133.63             6.3225e+01
HPL_pdgesv() start time Sat May  4 20:38:30 2024

HPL_pdgesv() end time   Sat May  4 20:40:44 2024

--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)=   3.29349238e-03 ...... PASSED
================================================================================

Finished      1 tests with the following results:
              1 tests completed and passed residual checks,
              0 tests completed and failed residual checks,
              0 tests skipped because of illegal input values.
--------------------------------------------------------------------------------

End of Tests.
================================================================================
geerlingguy commented 6 months ago

Re-tested today because I realized I had a 1TB USB flash drive plugged in during the benchmark, sucking down like 1W extra.

It's closer to 25W average, and I got 62.851 Gflops, across two runs, for 2.51 Gflops/W