Hi Team,
we would like to drop in our reward model and would like to how the following values were calculated and added to config ? was this calculated by running reward inference on entire train reward dataset ?
if hasattr(config, "mean"):
self.mean[0] = config.mean
self.std[0] = config.std
Hi Team, we would like to drop in our reward model and would like to how the following values were calculated and added to config ? was this calculated by running reward inference on entire train reward dataset ?