IBM / SALMON

Self-Alignment with Principle-Following Reward Models
https://arxiv.org/abs/2310.05910
GNU General Public License v3.0
148 stars 14 forks source link

Dataset: upload preference dataset #2

Open Dada-Cloudzxy opened 10 months ago

Dada-Cloudzxy commented 10 months ago

could you upload your all preference dataset so that we can redo it quickly?