Closed jordane95 closed 3 months ago
Will the model fine-tuning code also be released soon? Thanks.
Sorry. Currently, we cannot release the training code. But it's not difficult to implement. Basically, only two modifications are required.
Another hint, we use trl to support value model training.
Will the model fine-tuning code also be released soon? Thanks.