afsc-gap-products / gap_products

This repository supports code used to create tables in the GAP_PRODUCTS Oracle schema. These tables include the master production tables, tables shared with AKFIN, and tables publicly shared on FOSS.
https://afsc-gap-products.github.io/gap_products/
Creative Commons Zero v1.0 Universal
5 stars 5 forks source link

Add special projects and special projects collections table to oracle database #41

Open EmilyMarkowitz-NOAA opened 7 months ago

EmilyMarkowitz-NOAA commented 7 months ago

Data product requested: A few new tables in Oracle to store our special collections data, possibly living in RACE_DATA or RACEBASE. Open to new names for these tables, but I'll call them special_projects and special_project_collections for the time being. I've put examples for these new tables in the FUTURE_ORACLE google spreadsheet for us to play with.

I know this will require further discussion and coordination, but I wanted to formalize this idea with a GitHub issue for us. It is hugely problematic that we do not currently document what special projects we do/don't do or save the data collected for those special projects.

Other bigger picture considerations There are unintended consequences of bringing this data into Oracle that we will need to consider. I think while considering the below points, we should be careful to think of short, medium, and long term changes we need to make and goals to achieve. Not everything has to be done at once and we can make incremental progress on these ideas. Other issues/efforts that will need to be addressed include:

Other Tagging @zoyafuso-NOAA and https://github.com/afsc-gap-products/gap_products/issues/12 for awareness.

TLDR Short-term

Mid-term:

Long-term:

Ned-Laman-NOAA commented 7 months ago

I have also initiated a future database issue/concern with Autumn and OFIS so this is sitting in their parking lot when they have time to address it. It's our (GAP's) contention that database structure and infrastructure changes are solidly in OFIS' court to create and maintain. Remains to be seen if we win that argument in a timely fashion though!

Ned-Laman-NOAA commented 1 month ago

A few things to update here:

1 - GAP's databases (e.g., RACEBase and RACE_DATA) will now be maintained by GAP in the form of Chris Anderson. We will be able to tap into DB expertise from OFIS to advise us but developing databases and structures within our schemata will be our thing going forward. 2 - Alex D as the Special Collections Coordinator will need to weigh in on this process. She is developing/maintaining the RFP process and Google Forms so that makes sense for the front end and recent data. She and I are in the process of starting to map out how to recover historic data. 3 - I agree with Em that this is an important and needs to be addressed in the short term to support our data reports and to meet our obligations for storing and making some form of these data searchable and accessible. 4 - I would place digitizing the data from Special Collections in the Long Term goal category and maybe even Long Long Term. Actually, I may need to be convinced that we actually need to store the special collection data at all. It would be simpler for us to store just the particulars of a project and a count of how many samples were collected rather than the data themselves.