srush / Triton-Puzzles

Puzzles for learning Triton
Apache License 2.0
925 stars 57 forks source link

question about long softmax #7

Open hgl71964 opened 4 months ago

hgl71964 commented 4 months ago

I solve the long softmax puzzels, but I have to store the intermediate results to z_ptr, which may cause unnecessary Memory I/O.

Essentially, I would like to know if there's a solution to create temporary array in shared memory and store intermediate results there in Triton?

srush commented 4 months ago

There is 😃