nwfsc-fram / warehouse

FRAM Data Warehouse - Public
Other
5 stars 1 forks source link

Add Ageing_Facility to the individual_fact table in the DW and fill it for legacy data. #71

Open BHHorness-NOAA opened 4 years ago

BHHorness-NOAA commented 4 years ago

There can be biases between labs that read specimens for age determinations. This can be an important parameter for some assessed fish species, but has not been included to date in the data warehouse although available in all trawl survey legacy databases. A request was made by Chantel Wetzel to include this parameter.

BHHorness-NOAA commented 4 years ago

Transferred ageing lab id into staging database. Note that for some years prior to 2016 the meta data for age reads was added but the age result itself was not stored in the results table, probably to avoid duplication with the individual table. Curiously, in the later years (starting in 2010? and through 2015) the duplication of age data was introduced. Pentaho transformation was edited and rerun in the staging database. One step remains before pushing to prod: Age data for which the ageing facility is currently unknown needs to be set to -1 = Missing/Unknown. It is possible that if someone has the time and inclination, that Patrick McDonald can dig out this information to actively fill these gaps. Note also that it was discovered through the course of this effort that a considerable number of 2016 and 2017 age reads haven't been pushed to the DW (they do exist in the fram_central database). After this initial push for the ageing facility id, the transformation should be updated to push all applicable age data (the age, ageing date, and facility) to the DW (2016/2017 ages).

SaOgaz-NOAA commented 4 years ago

Hi @BHHorness-NOAA, what's the status on this one?