microsoft / Tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation
MIT License
724 stars 93 forks source link

enable message size larger than 4GB for all_to_all_v/all_gather_v #228

Closed ghostplant closed 7 months ago