The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model.
After adding __init__.py to /content/trlx/examples/summarize_rlhf/reward_model/, I still get the same error.
How can I fix it?
10 import torch
11 from datasets import load_dataset
---> 12 from reward_model.reward_model import GPTRewardModel
13 from tqdm import tqdm
14 from transformers import AutoTokenizer
ModuleNotFoundError: No module named 'reward_model.reward_model'; 'reward_model' is not a package
The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model. After adding
__init__.py
to/content/trlx/examples/summarize_rlhf/reward_model/
, I still get the same error. How can I fix it?