nnstreamer / nntrainer

NNtrainer is Software Framework for Training Neural Network Models on Devices.
Apache License 2.0
144 stars 73 forks source link

Using Bfloat16 GEMM from OpenBlas #2668

Closed skykongkong8 closed 2 months ago

skykongkong8 commented 3 months ago

It seems latest OpenBlas supports bfloat16 GEMM. I guess upgrading openblas version from here will simply bring their functions to the NNTrainer (0.3.18 -> 0.3.24)

1. hardware compatibility

2. accuracy

3. latency measurement

4. note

Bfloat16 is more robust to inf / NaN, which can be useful to mixed precision training and fp16fp32 accumulation.

taos-ci commented 3 months ago

:octocat: cibot: Thank you for posting issue #2668. The person in charge will reply soon.

skykongkong8 commented 2 months ago

It seems introducing this function in the current NNTrainer is not appropriate, and almost every participant in the project is aware of this issue. This issue can be raised again when the proper moment to introduce such functions has come.