About ema_mode - Githubissues

Hi @hsannn, sorry for the late reply. We used EMA only on the IMBD dataset primarily because: We noticed that performance on the IMDB dataset varied significantly with different runs. EMA often results in more stable results. Therefore, we explored the use of EMA on the IMDB dataset. However, even after employing EMA, we noticed the performance variance across different runs is still quite high compared to performance variance on other datasets even without EMA. We didn't use EMA on other datasets because the performance variance on them is small without EMA. But you can also explore EMA on other datasets, which might help you further boost the performance. Hope this information can help~

HenryPengZou / JointMatch

About ema_mode #2