HakaiInstitute / hakai-ctd-qc

Series of tests applied to the Hakai CTD profile data based on the QARTOD tests and other Hakai Specific ones.
0 stars 0 forks source link

Hakai Profile QC initial deployment #12

Closed JessyBarrette closed 2 months ago

JessyBarrette commented 1 year ago

Hakai Profile QC initial deployment

The hakai profile qc tool has been for a while used to generate the research and provisional datasets made available on the Hakai ERDDAP by generating NetCDF files. We need to include the qc results within the hakai database.

Any errors are tracked through Sentry: https://sentry.io/organizations/hakai-institute/projects/ctd-auto-qc/?project=6685251

Initial deployment steps on development environment

Deployment on production server

Related issues

The present issues are just present here to help track but do not need to get resolved prior to the deployment

13

14

JessyBarrette commented 1 year ago

A full QC of the hakaidev CTD profiles was completed last night with sucess and suggest no issues. The whole process was completed in ~6hours (~7000 profiles).

JessyBarrette commented 1 year ago

@fostermh I started populating the hakaidev CTD profiles schema with the automated qc flags generated by this repo.

All the flags are now made available on the goose.hakai.org/portal

I'm wondering if you are aware of any issues related to those latest changes? As far as I can see everything seems good and manage to handle all those flags accordingly.

JessyBarrette commented 1 year ago

Well unfortunately the dry run ended up not been a dry run due to a bad handling of the environment variables. QC flags are now already available on hakai db ctd schema for a major part of the CTD dataset.

fostermh commented 1 year ago

fun

JessyBarrette commented 1 year ago

Another dry run was completed on all the qced and unqced data ~10k profiles which last over 4 hours. We omitted for now the manually already qced data. This makes me wonder if want to add flags to those specific profiles. My first thought is yes we want that.

JessyBarrette commented 1 year ago

Since a major part of the flags are now already available the hakai ctd, @fostermh do you see any major issues, or I'm good to roll out the whole process?

fostermh commented 1 year ago

do you have an example file / cast I could look at that has had qc run on it in production?

JessyBarrette commented 1 year ago
Sure: hakai_id cause
01907467_2015-06-10T15:54:38Z instrument hit bottom lowest data is flagged
80217_2021-04-08T16:10:59.000Z density inversion and bottom hit
080217_2019-09-04T17:36:45.166Z cap was left on the oxygen sensor which is making the whole profile bad

You will see those in the _flag and _flag_level_1 columns

looking at this now I'm realizing that some of the thresholds are too sensitive for the bottom hit detection and do_cap. The tests will need to be refined in the near future.

fostermh commented 1 year ago

it seems ok but the level 1 flags are not making it to the excel or csv downloads at the moment so I will need to sort that out

JessyBarrette commented 1 year ago

Yeah I added that as an issue here #14 not sure if we really want to add it to the portal output. People can still retrieve if through the api or ERDDAP.

JessyBarrette commented 1 year ago

All the deployment is now pretty much there. Only the cron job to hecate is left and will left for the new year.

JessyBarrette commented 2 months ago

This is an old issue that should have been closed a while back