-
I am encountering a segmentation fault while running a DGEMM program with AMD BLIS. I am using the Flang compiler with AOCC and linking against the BLIS library. The program fails during the DGEMM op…
-
I have a thousand of tasks in parallel, each task has two steps:
1. matrix multiplication, C[i] = A[i]*B[i], the matrix sizes are non-uniform, and (m, n, k) is in range 10 ~1024.
2. some oper…
-
Support Vector Machines (SVM) is a supervised machine learning algorithm for classification and regression tasks. SVM works by finding the optimal hyperplane that separates data points of different cl…
-
The basecases for functions like ``nmod_mat_mul``, ``fmpz_mat_mul``, ``nmod_poly_mul`` and ``fmpz_poly_mul`` can be sped up significantly for half-word-size entries (and maybe bigger entries with more…
-
```
Optimize arithmetic operations in the MatVec function, especially the
middle one (Arithm3).
```
Original issue reported on code.google.com by `yurkin` on 26 Nov 2008 at 8:32
-
```
In chapter 21, the example proposed is a "multiplication of A times B with the
result added into C" (p.499)
This function is then presented as C = C + A * B throughout the chapter.
However, it s…
-
Follow [this tutorial](http://gpgpu-computing4.blogspot.de/2009/10/matrix-multiplication-3-opencl.html) for a first matrix multiplication with opencl.
-
The matrix \* vector multiplication is incorrect, effectively treating the matrix as row major when it is column major.
For example `for (int j = 0; j < N; j++) sum += a[i][j] * b[j];` should be `for…
deanm updated
8 years ago
-
Hey @tgolsson!
The AVX512F question is back on the radar with another team evaluating performance on this architecture. IIRC, you have already done the heavy lifting on this front, with decent resu…
-
```
In generic matrix multiplication, parameterize the operator. The operands
must match specific dimension requirements to have a valid matrix
multiplication.
For self-multiplication (operator*=),…