When webcasters are exporting audio, they hand type the minister's name. This is error prone, leading to misspelled names being displayed on the public facing archives. The spreadsheet-updater is downstream from the actual MP3 creation, as such it has no control over source data.
Proposed Solution
When parsing minister name from the MP3 tag, compare it to a master list of ministers; pick the minister's name that is the most similar. There are various algorithms for finding string similarity. See Grails' implementation of CosineSimilarity for an example.
Bonus Points
Extra nice if the master list is maintainable by LLC (perhaps another google spreadsheet?)
Situation
When webcasters are exporting audio, they hand type the minister's name. This is error prone, leading to misspelled names being displayed on the public facing archives. The spreadsheet-updater is downstream from the actual MP3 creation, as such it has no control over source data.
Proposed Solution
When parsing minister name from the MP3 tag, compare it to a master list of ministers; pick the minister's name that is the most similar. There are various algorithms for finding string similarity. See Grails' implementation of CosineSimilarity for an example.
Bonus Points
Extra nice if the master list is maintainable by LLC (perhaps another google spreadsheet?)