microsoft / tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation
MIT License
694 stars 84 forks source link

add device initialization for ops on non-default devices #223

Closed ghostplant closed 7 months ago