ucb-bar / gemmini-rocc-tests

Fork of seldridge/rocket-rocc-examples with tests for a systolic array based matmul accelerator
Other
52 stars 40 forks source link

CPU Matmul: factor out stride computation outside loops #16

Closed pranav-prakash closed 3 years ago

pranav-prakash commented 3 years ago

Even in the old version the compiler did loop unswitching for us so we computed the strides once inside j-loop (and with branch prediction the effect is negligible). Nonetheless, this version is cleaner since we remove branching entirely.