mapbox / gabbar

Guarding OpenStreetMap from harmful edits using machine learning
MIT License
19 stars 7 forks source link

Convert binary attributes into rich numericals #46

Closed bkowshik closed 7 years ago

bkowshik commented 7 years ago

Ref https://github.com/mapbox/gabbar/issues/43


In https://osmcha.mapbox.com/47414802/, a place=village was converted to place=town.

screen shot 2017-05-30 at 12 56 22 pm

At present, the context we give the machine learning model about this modification is along with other attributes are:

But, the model has no knowledge on what the modification was to make an effective prediction on whether the feature modification was a 👍 or a 👎. So, how about we convert the binary value representing the modification into better numerical's to aid the model to make a more informed decision.

Popularity from TagInfo

TagInfo provides values for what percentage of place features have say a city as the value. Ex: 0.43% of all place objects on OpenStreetMap are place=city. With this, the model will currently get the following attributes:


cc: @batpad @geohacker

bkowshik commented 7 years ago

We now have percentages from TagInfo as values. Ex:

screen shot 2017-06-07 at 11 50 58 am