zooniverse / AnnoTate

Full text transcription app for the Tate Britain
http://anno.tate.org.uk
Other
12 stars 2 forks source link

Line counts v page counts_possible change to how we assess user contribution #185

Open VVH opened 9 years ago

VVH commented 9 years ago

At the moment we're counting pages as the unit of classification, but we've badged this project as a line-by-line process and that every line counts. If we could give a little more info, e.g. tell people their line count based on @ggdhines' aggregation engine, and then the page count as normal, that would be ideal. Obviously this is not a make or break fix, but if we can make the change it would be good to make it sooner rather than later.

Slack conversation:

Cam Allen [10:49 AM] @vvh: we only count raw classification submissions (so pages) Cam Allen [10:49 AM] if you want a more granular count we'll have to do it via the aggregation engine vvh [10:49 AM] hm, is there any way to change that in future? Cam Allen [10:49 AM] nope vvh [10:50 AM] so we couldn’t feed in a count from the aggregation engine? Cam Allen [10:50 AM] yes we could Cam Allen [10:50 AM] but a classification submission is our base count Cam Allen [10:50 AM] unless you want to submit 1 classification per line vvh [10:50 AM] actually, that is what I want vvh [10:51 AM] the reason being that we’re saying to people ‘this is the unit of classification' vvh [10:51 AM] and the time it takes to do a line is about the time it takes to classify a galaxy or something on Snapshot.

VVH commented 8 years ago

I'd love to see this in Shakespeare's World and on any future text project on PFE. @camallen suggests this is easy enough to implement using annotation counts.

camallen commented 8 years ago

@VVH my suggestion re 1 classification : line will not be backwards compatible, i.e. counts would be out of sync between the two submission types.

We could look at building something for annotation like projects to count individual annotations in each classification instead of raw classification counts but its not a high priority with other work coming up.

camallen commented 8 years ago

@ggdhines how easy would it be for the aggregation engine to create a count of all annotations a user has submitted for all their classifications?

ggdhines-zz commented 8 years ago

a bit of work but not impossible - this would only be relevant for Annotate/SW but the code to read in classifications is in the base aggregation class which is used to aggregate all projects. So some refactoring would be needed. Don't think it would affect the overall way the code works but there might be some issues with double counting that I would need to think about.

camallen commented 8 years ago

If you could have a look that would be appreciated. Yep only the annotation projects that i know off, we'd keep a count of classifications (current) and then another of their line / marking efforts (some may be multi line).