NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment
Apache License 2.0
419 stars 44 forks source link

Ensure critic server does not squeeze out a singleton batch dim #198

Closed terrykong closed 3 weeks ago

terrykong commented 3 weeks ago

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Usage

# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

Checklist when contributing a new algorithm

Additional Information