I noticed in the model card for zephyr-7b-beta that you mentioned "removing the in-built alignment of these datasets boosted performance on MT Bench and made the model more helpful," resulting in a filtered 200k UltraChat version. Could you please elaborate on the criteria used for this filtering? Would it be possible to open-source the corresponding script? I'm wondering if this could assist me in cleaning and filtering my own SFT data.
I noticed in the model card for zephyr-7b-beta that you mentioned "removing the in-built alignment of these datasets boosted performance on MT Bench and made the model more helpful," resulting in a filtered 200k UltraChat version. Could you please elaborate on the criteria used for this filtering? Would it be possible to open-source the corresponding script? I'm wondering if this could assist me in cleaning and filtering my own SFT data.