ProjectSidewalk / sidewalk-quality-analysis

An analysis of Project Sidewalk user quality based on interaction logs
5 stars 3 forks source link

Will want to start building an end-to-end ML pipeline #15

Closed jonfroehlich closed 5 years ago

jonfroehlich commented 5 years ago

We will want to start building an end-to-end ML pipeline for predict user quality including:

Here's an example simple end-to-end ML pipeline for 3D gesture classification using an accelerometer that I built for my PhD course. This was a code skeleton (so does not use the best input features but definitely shows you how to create a full end-to-end classifier with scikit-learn):

To start, I think we can just do an 80/20 train-to-test split of the data. We will be getting more users and interaction logs as the project continues (and we get more validations).

See also these lectures:

daotyl000 commented 5 years ago

Neil & I have both created end-to-end ML pipelines in our notebook that we have been using.