tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
766 stars 55 forks source link

Code for Deduplication #8

Open qtli opened 3 weeks ago

qtli commented 3 weeks ago

Hi, thanks so much for your promising work! I was hoping to inquire if it's possible for you to provide me with the code for the "Deduplication" section. Thank you in advance for your help!