Closed s-debadri closed 1 month ago
:memo: TAOS-CI Version: 1.5.20200925. Thank you for submitting PR #2573. Please a submit 1commit/1PR (one commit per one PR) policy to get comments quickly from reviewers. Your PR must pass all verificiation processes of cibot before starting a review process from reviewers. If you are new member to join this project, please read manuals in documentation folder and wiki page. In order to monitor a progress status of your PR in more detail, visit http://ci.nnstreamer.ai/.
:octocat: cibot: @s-debadri, A builder checker could not be completed because one of the checkers is not completed. In order to find out a reason, please go to http://ci.nnstreamer.ai/nntrainer/ci/repo-workers/pr-checker/2573-202405081810420.82360291481018-ba95483e08b45051723aefb4a427b5a83ce3b4d6/.
Anyway, especially for non-SIMD operations (it's not going to be SIMDified because it's in if-statement condition), please note that (N & 0x7)
might be cheaper than (N % 8)
.
Anyway, especially for non-SIMD operations (it's not going to be SIMDified because it's in if-statement condition), please note that
(N & 0x7)
might be cheaper than(N % 8)
.
@s-debadri Could you please apply this idea?
LGTM.
For the readability of novice developers in the future, please leave a comment that you have applied bitwise operators instead of modulos for the performance. Otherwise, novice developers won't understand why in the heck you are using
& 0x...
here.
@myungjoo Added the comments as well. Thanks for your suggestion regarding this change.
Good to go! Thanks!
Changes added in this PR:
K%8 == 0
condition added before calling 1x4 and 1x8 kernels to enhance accuracy.hgemm_noTrans_4x4
beforehgemm_noTrans_1x8
.hgemm_noTrans_1x8
.Self evaluation:
Signed-off-by: Debadri Samaddar s.debadri@samsung.com