shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.
Apache License 2.0
186 stars 14 forks source link

Megablocks example #7

Open ehartford opened 7 months ago

ehartford commented 7 months ago

Can you please provide an example to use ScatterMoE with Megablocks? https://github.com/databricks/megablocks