Open MattReimer opened 1 year ago
@MattReimer , many of our duplicates I think we need to delete don't meet all your criteria. Specifically they are :
That is because they represent us figuring shit out with our iteration on things like VBET, RCAT, etc. Everything else you are saying is relevant, but we want to initially populate the warehouse with one (not multiple) versions of our production grade tools. Once those BLM, NRCS deliverables are complete, then your definition of a duplicate is much more relevant.
Understood. These criteria can be loosened and tightened but the base query remains relevant.
The ones that have the same version are an easy delete though so that's why I started there.
Let me preface this ticket by saying "BE VERY VERY CAREFUL!" Measure twice, cut once!
Sometimes through weird queueing mistakes we end up with projects that are duplicates of one another
A duplicate is defined as:
Example
https://warehouse.riverscapes.net/s?type=Project&geo=-91.9624074265721%2C30.40674608731412%2C8.766197736186141
You can use the new Cybercastor Sqlite3 dump script to pull all the project metadata from the warehouse into a bitesize package
Here's what ChatGPT gave me for finding duplicates while leaving the most current project out of the list
@philipbaileynar maybe you could check this carefully before we use it:
Concerns
Deleting projects graphql
INPUTS
Sample SQL output