Closed bkowshik closed 7 years ago
So, ideally in the future we will need some kind of an unsupervised classifier which is not limited by the subset of labelled samples in the dataset instead can make use of each and every changeset that comes along on OpenStreetMap.
With https://github.com/mapbox/gabbar/issues/88, we have an anomaly detection model for highways that is in a way not limited by the labelled dataset of highways. 🎉 The anomaly detection prototype is something we will iterate on for a couple of weeks. Let's re-open this issue when new ideas come along.
With the current supervised learning based classifier, we train the model on changesets labelled 👍 and 👎 Soon, a classifier will start predicting new changesets based on training on the labelled dataset. But, we have not manually 👀 all types of harmful changesets, maybe we never will as new kinds of problematic edits come along.
So, ideally in the future we will need some kind of an unsupervised classifier which is not limited by the subset of labelled samples in the dataset instead can make use of each and every changeset that comes along on OpenStreetMap.
cc: @anandthakker @batpad @geohacker