Open natolambert opened 5 months ago
With the human data AI2 has or a dataset like no_robots, we could test if a RM prefers the human or model answers to a completion.
no_robots
Update: should use an open weights model for completions to private prompts, otherwise company with API has access to closed test set prompts.
With the human data AI2 has or a dataset like
no_robots
, we could test if a RM prefers the human or model answers to a completion.