Open ericxsun opened 7 months ago
Yes it should work out of the box. Use MDSWrite to convert your preference data to MDS , and create a streaming dataset out of it. Use json as the encoding method. Let me know if you see any issue.
Yes it should work out of the box. Use MDSWrite to convert your preference data to MDS , and create a streaming dataset out of it. Use json as the encoding method. Let me know if you see any issue.
Thank you for quickly explain, I'll try it.
@ericxsun Wondering, have you tried the @XiaohanZhangCMU suggestion? Did it work?
Hi @ericxsun want to follow up here before closing this issue.
🚀 Feature Request
The preference data looks like this:
This data is used to train a Reward Model or DPO
I'm wondering if it's possible to use streaming for this kind of situation. And How? Thanks very much.