ahans30 / Binoculars

[ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text
https://arxiv.org/abs/2401.12070
BSD 3-Clause "New" or "Revised" License
193 stars 28 forks source link

Link to Evaluation Datasets #2

Closed dadamson closed 7 months ago

dadamson commented 8 months ago

While the Ghostbuster and Open Orca datasets are available, it would be great to be able replicate your results, or use them as a benchmark, with the Falcon/Llama examples you generated from CCNews, PubMed, and CNN data. Please share this data if you can!

ahans30 commented 8 months ago

Thanks for raising this. I will be putting up our datasets shortly. :)

lilakk commented 8 months ago

Yes it would be really helpful if you could release the outputs!

ahans30 commented 7 months ago

Hi, sorry for the delay. I have pushed the datasets. You can find them Binoculars /datasets/.