Hello-SimpleAI / chatgpt-comparison-detection

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
https://arxiv.org/abs/2301.07597
1.25k stars 120 forks source link

Do we have data splits? #19

Open sairights opened 1 year ago

sairights commented 1 year ago

Hi dear developers,

I am wondering whether data splits (e.g. train/val/test) has been released; I saw a issue 3 weeks ago with an official reply saying "We will release the train-test split later." However, as I inspected the data, it seems they have not been splited so far. Please remind me if the official data splits are released. Thanks!

izhx commented 1 year ago

Sorry, Just released! https://github.com/Hello-SimpleAI/chatgpt-comparison-detection/blob/main/HC3/README.md

bigheiniu commented 1 year ago

Hi, for table 7, does the training dataset include all domains or just a leave-one-out domain adaptation setting?