IBM / SALMON

Self-Alignment with Principle-Following Reward Models
https://arxiv.org/abs/2310.05910
GNU General Public License v3.0
144 stars 13 forks source link

Dataset: upload preference dataset #2

Open Dada-Cloudzxy opened 8 months ago

Dada-Cloudzxy commented 8 months ago

could you upload your all preference dataset so that we can redo it quickly?