rbroc / echo

A Scalable and Explainable Approach to Discriminating Between Human and Artificially Generated Text
https://cc.au.dk/en/clai/current-projects/a-scalable-and-explainable-approach-to-discriminating-between-human-and-artificially-generated-text
2 stars 1 forks source link

DailyDialog: Regenerate dataset with correct lengths + extract new metrics for it #56

Closed MinaAlmasi closed 7 months ago

MinaAlmasi commented 7 months ago

Regenerating DailyDialog

Dailydialog was created with incorrect min and max lengths and will therefore be regenerated as discussed in the March meeting (#51.) As it has not been urgent, focus has been on building the metrics extraction pipeline.

Dailydialog will be regenerated at some point before the classifiers have been finalised. NB. Remember to also extract metrics for the dataset again when it has been regenerated for all language models.