Closed pierrepaleo closed 2 years ago
I suspect it's this code snippet you're alluding to:
Have you tried changing those types to something bigger, say unsigned long
?
Also here:
Thanks @inducer it seems to solve the problem. Should I do a PR ?
Do you think changing these lines is enough to fix this class of problems, i.e, are there other files I should be looking at ?
Yes, I'd be happy to consider a PR. Thanks for offering!
If you're up for it, please look over reduction.py
and scan.py
for related issues.
Describe the bug
It seems that pycuda is not able to use arrays bigger than 17 GB. Allocating (with
gpuarray.empty
orgpuarray.zeros
) works, but any subsequent operation on the array will hang (no crash).To Reproduce
The same goes with a custom
ElementwiseKernel
applied on this array: the operation hangs but does not crash.The limits seems to be
2**34
bytes, meaning thatn_frames = 1023
should work in the above example.Doing the same with a C/Cuda programm works (I can provide a source code if needed).
Tried with the following configurations
Perhaps it has to do with the usage of
int
instead ofunsigned int
orsize_t
, but it looks like pycuda already usesunsigned
type at least in get_elwise_module.