Closed ian-r-rose closed 4 months ago
I'm thinking of using snowsql to do that. Approach 1: a sql query to drop the samples with duplications. Approach 2: create a new table, so the deduped data will be in the new table, rather than in-place removal as Approach 1.
What's your thoughts? @ian-r-rose
Closing as completed by #304
The 30-second data relay data may sometimes contain duplicate sample data, especially if we are recovering from incidents or backfilling.
We should insert some logic into the staging model to deal with this possibly-duplicated-data.