Open Mrinh212375 opened 1 week ago
Hi, it looks like the error arises because the PPOTrainer
class expects a value_model
to be defined and passed in, which appears to be required in the current TRL version. The disable_dropout_in_model
method is likely encountering NoneType
because value_model
wasn’t specified, and thus defaults to None
.
Hope this helps!
Hi, it looks like the error arises because the
PPOTrainer
class expects avalue_model
to be defined and passed in, which appears to be required in the current TRL version. Thedisable_dropout_in_model
method is likely encounteringNoneType
becausevalue_model
wasn’t specified, and thus defaults toNone
.Hope this helps!
Hi, thanks......I have passed value_model the same as policy_model, I thought it was optional, so didn't pass anything.........anyway error is gone. also I can call the ppo_trainer.train() method directly right ? unlike the older version, no need to write ppo training loop.....Can you please clarify on this point.
Glad to hear the error is resolved!
Yes, as far as I know, you can directly call the ppo_trainer.train()
method without needing to write a training loop.
System Info
Kaggle Notebook
Information
Tasks
examples
folderReproduction
outputs:
Expected behavior
https://github.com/huggingface/trl/blob/v0.12.1/trl/trainer/ppo_trainer.py#L91 In PPOTrainer class, how to pass nn.modules if I'm working with a HF PreTrainedModelWrapper model......Is there any way to extract nn.module from the PreTrainedModelWrapper ?
Checklist