ProjectSidewalk / sidewalk-quality-analysis

An analysis of Project Sidewalk user quality based on interaction logs
5 stars 3 forks source link

Restarting this with a fresh, clean list of input features for user classification #54

Open jonfroehlich opened 4 years ago

jonfroehlich commented 4 years ago

Columns I want to start out with:

Some other things I'm thinking about but don't need right away

jonfroehlich commented 4 years ago

Ongoing list of features and their descriptions here: https://github.com/ProjectSidewalk/sidewalk-quality-analysis/blob/master/data/ml-codebook.csv

jonfroehlich commented 2 years ago

@misaugstad, just coming back to this now after a while. Some things I'd still like to check out:

misaugstad commented 2 years ago

In a few minutes I'll be committing new CSVs including the column n_label_with_description, n_curb_ramp_with_description, n_missing_curb_ramp_with_description, n_obstacle_with_description, n_surface_problem_with_description, and n_no_sidewalk_with_description. I was already going to upload new CSVs without the minimum validations requirement, and this was an easy set of columns to add.

Time per pano is more complicated to add, so I'm thinking that I should be spending that time on incorporating Dutch translations in Sidewalk. lmk if this should be higher priority at any point.

jonfroehlich commented 2 years ago

Thank you. Agree. I think interactions per pano will be a reasonable proxy for time per pano anyway.