Closed gshennvm closed 2 months ago
add some much needed cleanup to the critic and reward model inference servers.
rest of the changes are in the changelog
thanks for the review! I checked the numerics locally against the previous main and ran a nemo generate only test. They both look good to my eye so I'm merging now.
add some much needed cleanup to the critic and reward model inference servers.
rest of the changes are in the changelog