Double data entry for the sorting task

I got the following suggestion after giving a short talk on OpenOversight: one way to automatically flag malicious people as well as catch errors is to use double data entry. In double data entry, as the name suggests, each image would be reviewed by two different people. If a particular user has a very high error rate they can be automatically flagged and removed from the system.

This issue is to enable this for the sorting task, and then potentially at a later date we can do this for the tagging task. We need:

To track which images should be displayed to users (instead of picking a random image that nobody has looked at yet as we do now)
A workflow for flagging malicious users: We need to figure out what kind of threshold is appropriate for flagging malicious users. To start, we should surface these flagged users to the administrators. An example threshold would be users that have flagged more than 10 images and more than 50% differ from the person that also tagged the image. Maybe admins can have a dashboard of some sort for monitoring this kind of information.
A way to resolve errors: even if a user is not being malicious, all people err. If an image gets two classifications that disagree, then an admin can resolve (we could also do this programmatically by defining some rules on what constitutes consensus but let's not run before we can walk).

lucyparsons / OpenOversight

Double data entry for the sorting task #187