zhanwenchen / relaug

MIT License
0 stars 0 forks source link

Poor Results for IMP and Transformer #81

Closed zhanwenchen closed 1 year ago

zhanwenchen commented 1 year ago
zhanwenchen commented 1 year ago

Diffs:

  1. DTYPE: float16 => DTYPE: float32
  2. BASE_LR: 1e-3 => BASE_LR: 1e-4
  3. SOLVER.SCHEDULE.TYPE: WarmupReduceLROnPlateau => SOLVER.SCHEDULE.TYPE: WarmupMultiStepLR
zhanwenchen commented 1 year ago

Still only 13.93. Maybe match the model_transformer implementation? Close this ticket and work in #115 instead.