Closed skykongkong8 closed 6 months ago
:memo: TAOS-CI Version: 1.5.20200925. Thank you for submitting PR #2541. Please a submit 1commit/1PR (one commit per one PR) policy to get comments quickly from reviewers. Your PR must pass all verificiation processes of cibot before starting a review process from reviewers. If you are new member to join this project, please read manuals in documentation folder and wiki page. In order to monitor a progress status of your PR in more detail, visit http://ci.nnstreamer.ai/.
This commit proposes a 8x16 kernel for Half-precision GEMM Note that this is not an '100%' optimized version of HGEMM, but still better than before. Following is unittest output with f16-f32 partial accumulated HGEMM. Fine accuracy with better latency.