Open fanminshi opened 2 months ago
I performed the step 3 moe-finetunning on phi-2 model, the loss doesn't seemed to drop that much. I wonder if that's normal. Thanks!
Question
I performed the step 3 moe-finetunning on phi-2 model, the loss doesn't seemed to drop that much. I wonder if that's normal. Thanks!