microsoft / Tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation
MIT License
724 stars 93 forks source link

add device initialization for ops on non-default devices #223

Closed ghostplant closed 9 months ago