lucasdelimanogueira / PyNorch

Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)
117 stars 7 forks source link

broadcasted batched matmul cuda #62

Closed lucasdelimanogueira closed 6 months ago