lucasdelimanogueira / PyNorch

Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)
117 stars 7 forks source link

sum axis cuda version #63

Closed lucasdelimanogueira closed 6 months ago