nnstreamer / nntrainer

NNtrainer is Software Framework for Training Neural Network Models on Devices.
Apache License 2.0
144 stars 73 forks source link

[ hgemm ] Support beta case @open sesame 06/04 13:13 #2585

Closed skykongkong8 closed 3 months ago

skykongkong8 commented 4 months ago

// redundant case 1 tmp = A.dot(B); C.multiply_i(scalar); C = tmp.add(C);

// use beta C = A.dot(B, C, transA , transB, scalar);

// redundant case 2 C = A.dot(B); C.add_i(scalar);

// use beta C = C.ones(); C = A.dot(B, C, transA , transB, scalar);


- In addition add zero-init code for beta = 0.F case. According to recent model profiling result, even for initialization, minimizing instruction is quite helpful for overall model latency reduction.

**Self evaluation:**
1. Build test:     [X]Passed [ ]Failed [ ]Skipped
2. Run test:     [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: skykongkong8 <ss.kong@samsung.com>
taos-ci commented 4 months ago

:memo: TAOS-CI Version: 1.5.20200925. Thank you for submitting PR #2585. Please a submit 1commit/1PR (one commit per one PR) policy to get comments quickly from reviewers. Your PR must pass all verificiation processes of cibot before starting a review process from reviewers. If you are new member to join this project, please read manuals in documentation folder and wiki page. In order to monitor a progress status of your PR in more detail, visit http://ci.nnstreamer.ai/.