Closed gshennvm closed 2 weeks ago
fix issue when reward model is trained with 24.01 but loaded with 24.05
fix issue when reward model is trained with 24.01 but loaded with 24.05