Closed zhanwenchen closed 1 year ago
Diffs:
DTYPE: float16
=> DTYPE: float32
BASE_LR: 1e-3
=> BASE_LR: 1e-4
SOLVER.SCHEDULE.TYPE: WarmupReduceLROnPlateau
=> SOLVER.SCHEDULE.TYPE: WarmupMultiStepLR
Still only 13.93. Maybe match the model_transformer implementation? Close this ticket and work in #115 instead.