MAPC / arts-and-culture-asset-list

0 stars 0 forks source link

Merge Duplicate Sites #5

Open theryankelly opened 3 years ago

theryankelly commented 3 years ago

After Completion of #1 #2 #3 done we can look to merge the duplicate Sites. Will use Site Address Conflict and Dup Check to merge sites further. Expect it to be ~ 1,165 not the current 2207.

lperkinshigh commented 3 years ago

On 2/4 and 2/5, I ran two de-dupes on the Sites table. General process notes below:

  1. Deduplicated sites in Sites 1.0 (2,227 records) using the Site Name (To Edit) field. To do so, I made a copy of the Site Name (To Edit) field, changed that field to a linked field, linking to a new table, Sites 2.0. I then transferred field data from Sites 1.0 to Sites 2.0 via ROLLUP/'ARRAYUNIQUE'.
  2. I then reviewed conflicting address information (~40) in Sites 2.0 (rolled up values from the Spaces table) and either split the site (in the case where a conflicting address meant two buildings) or input the appropriate address.
  3. Deduplicated sites in Sites 2.0 (1,395) using the Full Address field. As with Sites 2.0, I created the values in the final table, Sites 3.0, by transferring field data from Sites 2.0 to Sites 3.0 via ROLLUP/'ARRAYUNIQUE.'
  4. I renamed Sites 3.0, Sites. This table has 1,290 records.