chufanchen / read-paper-and-code

0 stars 0 forks source link

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers #115

Open chufanchen opened 6 months ago

chufanchen commented 6 months ago

https://arxiv.org/abs/2403.10266