Closed huu4ontocord closed 3 years ago
Hi,
Indeed the error is cryptic but the issue is that using 12
heads with 64
value dimensions means that the feature dimensions of x should be 12*64 = 768
and not 128
. If you change the feature dimensions it should run fine.
If you continue to encounter problems feel free to reopen the issue or open a new one.
Cheers, Angelos
This code from the example notebook produces a cuda error when changing the headers,dimensions, etc. Running in Colab:
RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling
cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)``