How to compute AUROC using the Binocular scores?

ahans30 / Binoculars

[ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text

BSD 3-Clause "New" or "Revised" License

189 stars 26 forks source link

Hi, thanks for your interest in the project. Below is code snippet you can use to compute AUC using standard pandas and sklearn libs.

from sklearn import metrics

# df is pandas dataframe with ground truth `sample_class` (1 for AI-generated, 0 for human-generated text)
score = "binoculars_score"
label = "sample_class"

df[score] *= -1 # reverse scale so that higher score indicates positive/AI-generated class
fpr, tpr, _ = metrics.roc_curve(y_true=df[label], y_score=df[score], pos_label=1)
roc_auc = metrics.auc(fpr, tpr)

Hopefully, this should help you out. I'm closing this issue, but please feel free to comment if you further face any issues.

ahans30 / Binoculars

How to compute AUROC using the Binocular scores? #4