-
The problem: Can't install giotto-tda, giotto-ph, giotto-time etc on aarch64 (NVIDIA Jetson) architectures. I get the error:
ERROR: Could not build wheels for giotto-tda.
The reason: Allowing …
-
Hi,
Warp reduce functions are available in CUDA (cf. https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#warp-reduce-functions) but not in HIP.
There is equivalent functionality in …
Epliz updated
6 months ago
-
I am trying to compile the following code based on a raja vector sum example.
```
#include
#include "chai/ManagedArray.hpp"
#include "RAJA/RAJA.hpp"
int main(int argc, char* argv[]) {
us…
-
CUDA (Compute Unified Device Architecture) is a parallel computing platform and application programming interface (API) model created by NVIDIA. It allows developers to use GPUs (Graphics Processing U…
-
Part of #4290
CuPy JIT needs more coverage of the functions and attributes supported in CUDA.
Each function can be implemented in CuPy JIT by writing a class that inherits from `BuiltinFunc` [her…
-
Hello, thanks for the great work.
Speed-up between CPU and GPU is obviously great but what about speed difference between CUDA and Bend for the same algorithm? A comparison using a tensor operation…
-
When discussing the Thrust JIT support with the CCCL team, a question was raised regarding the usage of the `jit.thrust.device` policy in the test suite, ex:
https://github.com/cupy/cupy/blob/be5d7f…
-
Here's some good news straight from the mouth of NVIDIA
http://docs.nvidia.com/cuda/cuda-c-programming-guide/#shared-memory
-
Exciting work, I am very interested, but since my coding ability is weak, can you provide a CUDA code about DCA, it will be greatly appreciated
nanmi updated
2 months ago
-
Hi,
I'm curious about what happen if we take another step for this awesome algorithm and add GPU support to it? does the performance may be added dramatically ?