Closed gwarmstrong closed 1 week ago
@Kipok the gpu test failure appears to have come from an unrelated test--any idea what that's about?
The tests are failing because of an issue introduced in another PR. Let me run the training tests locally to double check there are no issues there and we can merge after that
Augments the training pipeline to support Reward Model Training with NeMo-Aligner