rbroc / echo

A Scalable and Explainable Approach to Discriminating Between Human and Artificially Generated Text
https://cc.au.dk/en/clai/current-projects/a-scalable-and-explainable-approach-to-discriminating-between-human-and-artificially-generated-text
2 stars 1 forks source link

Finish generations of data (for now!) #49

Closed MinaAlmasi closed 6 months ago

MinaAlmasi commented 6 months ago

Generations Overview

Data has been generated for four datasets (stories, dailydialog, dailymail_cnn, mrpc) across four models beluga7b, mistral7b, llama2_chat7b and llama2_chat13b across three . Stories was generated with two diff. prompts!

Costs were not high, so we can definitely generate more data if needed at some point (or across more temperatures).

Some plotting (of length distributions) was also done!

Steps from here