Closed FabianSchuetze closed 1 day ago
I finally figured it out.
The library needs to be built with the additional option benchmark_examples
and the test is run on the device with:
./benchmark_neon_sgemm --iterations=100 --example_args=2048,2048,2048
Thanks for the wonderful library. Apologies if this seems to be a silly question:
How can I benchmark gemms on a android target?
In line with the docs for test I run the following on my target (including output):
However, only
Scale
benchmarks seem to be run.I am interested in running Int8 GEMMS with (Int 32 accumulator) and obtain the GFLOPS/sec my target supports. I would like to use all cores on my system. I would best like to test the
SMMLA
(UMMLA
) instructions.I build the
arm_compute_benchmark
binary with the following command:I also had to slightly modify the
Sconstruct
file, the patch is below ( a bit hacky, but I'm only interest in cross-compilation):