Open CCMKCCMK opened 5 months ago
Hi Artyom, I saw this question, and I wish the following would save you more time: Pow should be piece of cake. For arange, there are some GPU source for your reference: host code: https://sourceforge.net/p/octave-ocl/code/ci/default/tree/src/ocl_array.cc#l825 along with kernels https://sourceforge.net/p/octave-ocl/code/ci/default/tree/src/ocl_array_prog.cc#l34 (The author is Matthias W. Klein, I believe he is German. We were contacted via emails. Like you, we are all busy with real world stuffs. I was active in testing and patch it to AMD/Apple Silicon platforms. https://sourceforge.net/projects/octave-ocl-extra/files/) Meanwhile, the Octave CPU version: line 631: https://docs.octave.org/doxygen/4.4/d8/d00/idx-vector_8h_source.html
Best wishes, Jinchuan
I am currently using opencl to test for LLMs <2B on my small computer. But seems some important functions are not yet supported.