lucasdelimanogueira / PyNorch

Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)
117 stars 7 forks source link

fix sum cuda axis atomicAdd #67

Closed lucasdelimanogueira closed 6 months ago