Closed davidkroodsma closed 8 years ago
Initial name normalization UDF function is here: https://github.com/GlobalFishingWatch/vessel-lists/blob/master/clav_matching/name_normalize.js
It needs work for the international characters, but for now I am closing this issue.
Write a UDF function in BigQuery to normalize the vessel names to improve the current version of @Bjorn-skytruth's matching algorithm.
Examples of how the names should be normalized are here: https://docs.google.com/spreadsheets/d/1DZ_7VAbGS63wSRk8Yac00O0oTqgdFu66OnB_oFcvxOg/edit#gid=1459190414