This PR updates the Check for Duplicate Marks script to v3.0, fixing an issue where the scripts was expecting an incorrect CSV format.
The earlier version of the script assumes the Classifications Exports CSV received (from a Zooniverse.org's Project Builder page's Data Exports) is of the format: "id";"project_id";"user_id";"workflow_id";"annotations";"created_at";"updated_at";"user_group_id";"user_ip";"completed";"gold_standard";"expert_classifier";"metadata";"workflow_version";"lifecycled_at"
In actuality, the format is classification_id,user_name,user_id,user_ip,workflow_id,workflow_name,workflow_version,created_at,gold_standard,expert,metadata,annotations,subject_data,subject_ids
PR Overview
Original report: https://zooniverse.freshdesk.com/a/tickets/3291 (in response to the email sent out by @mrniaboc informing users about the duplicate marks bug)
This PR updates the Check for Duplicate Marks script to v3.0, fixing an issue where the scripts was expecting an incorrect CSV format.
"id";"project_id";"user_id";"workflow_id";"annotations";"created_at";"updated_at";"user_group_id";"user_ip";"completed";"gold_standard";"expert_classifier";"metadata";"workflow_version";"lifecycled_at"
classification_id,user_name,user_id,user_ip,workflow_id,workflow_name,workflow_version,created_at,gold_standard,expert,metadata,annotations,subject_data,subject_ids
This script has now been corrected.
Status
Quick fix, merging.