Open jeffhammond opened 7 months ago
I don't know if it is actually pointless, but it is the variant of GEADD that was originally implemented by ATLAS. (Fun with names of non-standardized BLAS extensions, I guess the most feature-rich GEADD is the one in IBM's ESSL ? Also patches welcome...)
i'll try to work on a patch.
i have not found a single application that uses ?GEADD
, but i know that lots of code isn't publicly available.
None of the codes available to me seem to use it, neither does anything public on github AFAICT, there never was any complaint about the lack of transpositions here (or about the lack of GEADD in the reference implementation either), so maybe it might as well not exist ? :)
?GEADD
has virtually no utility over?AXPBY
, which is itself unoptimizable (i.e. the equivalent loops, optimized by a compiler, will perform at least as well in all cases).Both
appleblas_?geadd
andcublas?geam
support transposition, which is useful, in part because transposition is nontrivial to optimize.cblas_?omatcopy
, which is inspired by MKL, has transpose but does not support the generality of accumulation of the former.It would be nice to have a GEAM-style routine in OpenBLAS.