feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Apache License 2.0
351 stars 24 forks source link

add torch profiler #41

Closed feifeibear closed 6 months ago