GlobalFishingWatch / vessel-lists

DEPRECATED: See https://github.com/GlobalFishingWatch/treniformis
Apache License 2.0
2 stars 1 forks source link

Normalize Vessel Names #10

Closed davidkroodsma closed 8 years ago

davidkroodsma commented 8 years ago

Write a UDF function in BigQuery to normalize the vessel names to improve the current version of @Bjorn-skytruth's matching algorithm.

Examples of how the names should be normalized are here: https://docs.google.com/spreadsheets/d/1DZ_7VAbGS63wSRk8Yac00O0oTqgdFu66OnB_oFcvxOg/edit#gid=1459190414

davidkroodsma commented 8 years ago

Initial name normalization UDF function is here: https://github.com/GlobalFishingWatch/vessel-lists/blob/master/clav_matching/name_normalize.js

It needs work for the international characters, but for now I am closing this issue.