huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences
https://huggingface.co/HuggingFaceH4
Apache License 2.0
4.6k stars 401 forks source link

Questions about data filtering for zephyr-7b-beta's UltraChat version #7

Open jc-ryan opened 11 months ago

jc-ryan commented 11 months ago

I noticed in the model card for zephyr-7b-beta that you mentioned "removing the in-built alignment of these datasets boosted performance on MT Bench and made the model more helpful," resulting in a filtered 200k UltraChat version. Could you please elaborate on the criteria used for this filtering? Would it be possible to open-source the corresponding script? I'm wondering if this could assist me in cleaning and filtering my own SFT data.

ananddw24 commented 11 months ago

This will be a really helpful script in helping asset. Looking forward to the release..!!