Closed inkenbrandt closed 10 years ago
Wasn't there a chance of duplication of id numbers? If we strip that WQX
value, and there is a duplicate, what needs to happen?
The stations having stationids containing "_WQX" are the most recent records, so, if possible, they should be the ones that are retained if duplicates are detected. However, I think it would be fine to delete either record (old or new), as the station information will essentially be the same.
@inkenbrandt, Is it safe to assume that the duplicates will exist in their respective program and that duplicates will not be created when merging all of the programs into one master location?
And does this only apply to the WQP program?
Yes, it is safe to assume that we will not create new duplicate station ids when merging, and yes, this issue of removing the "_WQX" string is only applicable to the WQP program
We have to limit duplicates by standarizing the stationid values. Example: "UTAHDWQ-4946750" is the same as "UTAHDWQ_WQX-4946750" We need to create a script that searches through the StationId field (in both the results and station tables) and strips out the string "_WQX"