GAIR-NLP / ProX

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
https://gair-nlp.github.io/ProX/
Apache License 2.0
195 stars 15 forks source link

where are codes about refine data ? #4

Closed peiji1981 closed 2 months ago

peiji1981 commented 2 months ago

where are codes about refine data ? I can only see the similar codes with tinyllama project

koalazf99 commented 2 months ago

Thank you for your interest in ProX! Please check #2 first; we will release the refining framework code later.