openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Apache License 2.0
114 stars 4 forks source link

[Patch Fixes] Fix several bugs and unintended behaviors. #55

Closed garrett4wade closed 2 months ago

garrett4wade commented 2 months ago

Patch Fixes:

Bug Fixes:


Changes after review

garrett4wade commented 2 months ago

@nuzant Review is not emergent. Take your time.