Integration of the grid to work with mini_core_accel
Perform dedicated tests to check functionality
I run 4x4 matrix multiplication using for loops to see the performance and I got x13 in speed up. I can increase the speed up even more. Its super easy to increase the systilolic array size and get much more speed up.
mini_core_accel
I run 4x4 matrix multiplication using
for loops
to see the performance and I gotx13
in speed up. I can increase the speed up even more. Its super easy to increase the systilolic array size and get much more speed up.