codeplaysoftware / portBLAS

An implementation of BLAS using the SYCL open standard.
Apache License 2.0
258 stars 50 forks source link

Refactor level1 nrm2 operator #466

Closed s-Nick closed 1 year ago

s-Nick commented 1 year ago

This PR refactors the implementation of the nrm2 operator.

These changes are based on the work done for asum operator which was merged with PR #458. Now nrm2 implementation shares the same structure of asum adjusting the execution call to its needs. Update for nrm2 each architecture backend file to manage the different kernel sizes and these numbers are set empirically using available hardware.