ProjectSidewalk / sidewalk-quality-analysis

An analysis of Project Sidewalk user quality based on interaction logs
5 stars 3 forks source link

Likelihood that a label is correct vs. worker quality inference #14

Open jonfroehlich opened 5 years ago

jonfroehlich commented 5 years ago

I've been thinking about two highly related (but still different tasks):

  1. Creating a model that estimates the likelihood of correctness for any given label based on, for example, it's location on a street segment, it's x,y,z location on the labeling canvas, other labels on proximal streets, and possibly even external factors (e.g., census tract data, etc.)

  2. Creating a model that estimates the quality of a crowd worker--some of which may take advantage of the first point but other features may be independent.