The README.md files in the model and the model/model_training folder include the following command python trainer_rm.py --configs defaults_rm oasst-rm-1-pythia-1b to train the reward model.
Running this command returns a KeyError: 'oasst-rm-1-pythia-1b' since the model/model_training/configs/config_rm.yaml file only has the following model options
The README.md files in the model and the model/model_training folder include the following command
python trainer_rm.py --configs defaults_rm oasst-rm-1-pythia-1b
to train the reward model. Running this command returns aKeyError: 'oasst-rm-1-pythia-1b'
since the model/model_training/configs/config_rm.yaml file only has the following model options