Open TechxGenus opened 3 months ago
Thanks for opening this PR up! Admittedly we dont have much CI testing for PyTorch on 4090. Would you mind trying to create a minimal repro on posting it on PyTorch. Feel free to tag me in the issue
Thanks. I create it here: https://github.com/pytorch/pytorch/issues/133254
Thanks for sharing this great resource. I'm trying to run some benchmarks with
test_mask
fromexamples/flex_attn.ipynb
on one RTX 4090. When I setB=1,H=16,S=2048,D=128
, it triggers an error: