Closed tjk9501 closed 1 year ago
There is warm up time. CuPy does runtime compilation. Would suggest running several times (like with timeit
). Otherwise this is mainly capturing the compilation time
Hello, jakirkham: I have another interesting question, for instance, if we have a NVIDIA GPU with Graphical memory being 10 GB, and we create two matrices A and B, with each's size being 5GB and 6GB of the GPU Memory. Then we want to perform matrix multiplication of C = dot(A,B) with cupy dot function, what will happen if we use Cupy Framework? Will Cupy report out of GPU memory or will it allocate GPU memory to complete calculation automatically?
Maybe this discussion should continue over in a CuPy issue or perhaps the CuPy Gitter channel since this isn't really about the CuPy Conda package?
Indeed. CuPy already documented answers for your first question: https://docs.cupy.dev/en/stable/user_guide/performance.html As for the second question, you'd likely get OOM. Please use the official channels for CuPy questions. This is the issue tracker for its conda-forge package.
Solution to issue cannot be found in the documentation.
Issue
Hello, I use the following simple code to test the performances of Cupy and Numpy (Using NVIDIA GeForce 2080Ti GPU):
and the Output result is:
It can be clearly seen that in processing this array, numpy is significantly faster than cupy, can anyone give an explanation to this? How can I optimize Cupy to achieve better performances?
Installed packages
Environment info