NBMG-UNR / resourcespace-migration

Code and tasks related to migrating GBSSRL into a CMS
0 stars 1 forks source link

Air Photos #2

Open emilyodean opened 4 years ago

emilyodean commented 4 years ago

We have a TON of air photos which all have different indices. We need to come up with a reasonable way to combine all of these into one spreadsheet. This should be done in Python.

We should maintain important information from each separate index (identifiers, dates, source, if there's any location information, etc) but also should add a unified ID to the new spreadsheet that combines all of these. I'd like to add an ID for physical location in our warehouse as well.

emilyodean commented 4 years ago

NVAerials2.xlsx NVAerials3.xlsx NVAerials4.xlsx NVAerials7.xlsx SCS_EXTRA_PHOTOS.xlsx aerial_photos_index_2.xlsx AERIALPH.xlsx AerialPhotoData21.xlsx

emilyodean commented 4 years ago

@lweeks20 this can be done after #3 and #1

emilyodean commented 4 years ago

@lweeks20 please work on this tomorrow. Information is provided above. Combine the given spreadsheets into one spreadsheet with common column names. The column names are detailed and basic code is started here: https://github.com/NBMG-UNR/resourcespace-migration/blob/master/clean_airphotos.

emilyodean commented 4 years ago

NVAerialsUSGS03032020.xlsx

emilyodean commented 4 years ago

@lweeks20, we'll chat tomorrow morning but I also wanted to give you some written information on this ticket. We just got 11 boxes of air photos from USGS which are at the top of the back stairs at GBSSRL. The spreadsheet above corresponds to the boxes, and each box has info written on it about its contents and which sheet within the spreadsheet contains the corresponding metadata.

I need you to go through this spreadsheet and relate it to the master air photos sheet that you generated from all of disparate spreadsheets that we had. We need to identify if they sent us duplicates of any that we already have in our collection, and if there are duplicates we need to identify which duplicate is in better shape. If there are new air photos that we don't yet have in our collections, we need to separate those out so we can formally accession them.

I'm going to leave the process of how to do the above up to you, but I would like an update at the end of each workday, so I'd recommend making a spreadsheet of which boxes you've gone through, which air photos are duplicates, and which ones are unique.

lweeks20 commented 4 years ago

Here's our master list updated_aerial_photos_02132020.xlsx