FMInference / DejaVu

268 stars 32 forks source link

Training of predictors based on MOE model #31

Open mailonghua opened 1 month ago

mailonghua commented 1 month ago

Hello everyone, I want to train the FFN low-rank predictor for the MOE model,so how should i choose the point to collect training data? Switch Transformers