HakaiInstitute / hakai-datasets

Hakai Datasets that are going into https://catalogue.hakai.org/erddap/
0 stars 0 forks source link

Dataset Submission: Synthesized Nutrient Dataset associated with research paper by Hayley Dosser #64

Closed raytula closed 2 years ago

raytula commented 2 years ago

Hakai Dataset Submission

The intention of this issue is to make a nutrient dataset created and used by Hayley as part of a research paper Findable and Accessible via the Hakai metadata catalogue. Similar to other synthesized/paper specific datasets, this dataset included data from multiple sources that has been aggregated and processed in ways specific a particular research project/paper.

Related examples include:

Below are listed all the different steps related to the initial submission of a dataset.

A more detailed written and visual description of every step is available respectively here and here.

Submission steps

Initial Submission (Data Administrator)

Online Dataset Creation (Data Integrator)

Dataset Review (Data Administrator)

Dataset Completion (Data Integrator)

raytula commented 2 years ago

Hi @hvdosser [cc @jenjax2 @timvdstap @JessyBarrette ] I created this issue to document and track the steps required to make your Nutrient dataset available online and referenced by a DOI. I will speak with others early next week to see who is available to help you with some of the technical steps. However, you are welcome to get started on this anytime. In particular:

Once the metadata form is complete, including links to the related data/other files in Google Drive or GitHub, it will be published to the Hakai metadata catalogue (like the OA dataset) and DOI can be created that will link to the published record.

Ray

JessyBarrette commented 2 years ago

@hvdosser Hi Hayley! I hope things are going well for you! Let me know if you have any questions regarding any of the points given by Ray above. I can help you get all this sorted out!

Best regards, Jessy

timvdstap commented 2 years ago

Hi Hayley!

I'll echo @JessyBarrette - happy to help where I can!

Cheers, Tim

hvdosser commented 2 years ago

Hi Jessy,

Thanks! Unfortunately, I'm still not completely sure what the plan is. If possible, I would like to get a DOI for the existing Hakai Institute nutrient dataset. This would 1) avoid the need to go through the whole process again for the subset of samples I used and 2) be consistent with the link we provided the journal for the full DFO nutrient dataset available on CIOOS. If this is a possibility, what would be my next steps?

Cheers, Hayley

On Tue, Oct 19, 2021 at 2:15 PM Jessy Barrette @.***> wrote:

@hvdosser https://github.com/hvdosser Hi Hayley! I hope things are going well for you! Let me know if you have any questions regarding any of the points given by Ray above. I can help you get all this sorted out!

Best regards, Jessy

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/HakaiInstitute/hakai-datasets/issues/64#issuecomment-947111001, or unsubscribe https://github.com/notifications/unsubscribe-auth/AK7G4ZLHRXFVYFVX7AE2NDLUHXNYHANCNFSM5GCNOM2Q . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

jenjax2 commented 2 years ago

Hi @hvdosser and @JessyBarrette After talking to @raytula, it seems like making one large synthesis product of all of the data you used (DFO and Hakai both CTD and nutrients) may make the most sense.

raytula commented 2 years ago

Hi @hvdosser

My understanding is that your paper is based on Hakai and DFO nutrient data from a broad geographic area and over a relatively long time period. If this is correct, then the Hakai Research data only includes a subset of the Hakai data used for your paper, as it currently only included research-ready data from one site and for a relatively short period of time. Regarding the DFO nutrient data, the CIOOS link you provided is subject to change, so not really appropriate for publication. Also, we are not yet in a position where we are generating DOIs (ie. permanent links) to non-Hakai datasets shared via CIOOS (but that may come). So, from a technically-focused perspective, I believe that sharing a synthesis dataset that includes all of the data used for the paper is best approach. In the future, if your research is based on a subset of data that can be accessed via existing DOIs, then using those DOIs would likely be a better approach, but we're not there yet.

@JessyBarrette and @timvdstap can help your through the process of completing the metadata record, making your data available online and generating a DOI.

Cheers, Ray

hvdosser commented 2 years ago

Hi Ray,

Thank you for the help and clarification!

I was under the impression (based on instructions from Charles Hannah) that the permanent links provided through CIOOS would remain fixed, so it's good to know that's not the case. I also very much appreciate Jessy and Tim's help, as I imagine the metadata record for the entire combined dataset will be substantial. I'm going to confirm with my co-authors that everyone is comfortable with this approach, and then we can get this done!

Thank you again, Hayley

On Thu, Oct 21, 2021 at 7:42 AM Ray Brunsting @.***> wrote:

Hi @hvdosser https://github.com/hvdosser

My understanding is that your paper is based on Hakai and DFO nutrient data from a broad geographic area and over a relatively long time period. If this is correct, then the Hakai Research data only includes a subset of the Hakai data used for your paper, as it currently only included research-ready data from one site and for a relatively short period of time. Regarding the DFO nutrient data, the CIOOS link you provided is subject to change, so not really appropriate for publication. Also, we are not yet in a position where we are generating DOIs (ie. permanent links) to non-Hakai datasets shared via CIOOS (but that may come). So, from a technically-focused perspective, I believe that sharing a synthesis dataset that includes all of the data used for the paper is best approach. In the future, if your research is based on a subset of data that can be accessed via existing DOIs, then using those DOIs would likely be a better approach, but we're not there yet.

@JessyBarrette https://github.com/JessyBarrette and @timvdstap https://github.com/timvdstap can help your through the process of completing the metadata record, making your data available online and generating a DOI.

Cheers, Ray

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/HakaiInstitute/hakai-datasets/issues/64#issuecomment-948688549, or unsubscribe https://github.com/notifications/unsubscribe-auth/AK7G4ZNWAUGB2FAOIX7Y2QTUIARGJANCNFSM5GCNOM2Q . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

hvdosser commented 2 years ago

Hi Ray,

Having spoken with my co-authors (and particularly with Charles Hannah about the DFO data), I think the preferred approach is to use the CIOOS links for everything except the Hakai nutrient data, and create a new DOI for only those data. He recognized that the CIOOS links would change, but feels that it's the best we can do for now (and will keep my paper consistent with other publications from DFO scientists). So let's move forward with getting a link for the Hakai nutrient data we used. I'll start on the metadata form later today.

Thanks so much for your help sorting this out! Cheers, Hayley

On Thu, Oct 21, 2021 at 9:53 AM Hayley Dosser @.***> wrote:

Hi Ray,

Thank you for the help and clarification!

I was under the impression (based on instructions from Charles Hannah) that the permanent links provided through CIOOS would remain fixed, so it's good to know that's not the case. I also very much appreciate Jessy and Tim's help, as I imagine the metadata record for the entire combined dataset will be substantial. I'm going to confirm with my co-authors that everyone is comfortable with this approach, and then we can get this done!

Thank you again, Hayley

On Thu, Oct 21, 2021 at 7:42 AM Ray Brunsting @.***> wrote:

Hi @hvdosser https://github.com/hvdosser

My understanding is that your paper is based on Hakai and DFO nutrient data from a broad geographic area and over a relatively long time period. If this is correct, then the Hakai Research data only includes a subset of the Hakai data used for your paper, as it currently only included research-ready data from one site and for a relatively short period of time. Regarding the DFO nutrient data, the CIOOS link you provided is subject to change, so not really appropriate for publication. Also, we are not yet in a position where we are generating DOIs (ie. permanent links) to non-Hakai datasets shared via CIOOS (but that may come). So, from a technically-focused perspective, I believe that sharing a synthesis dataset that includes all of the data used for the paper is best approach. In the future, if your research is based on a subset of data that can be accessed via existing DOIs, then using those DOIs would likely be a better approach, but we're not there yet.

@JessyBarrette https://github.com/JessyBarrette and @timvdstap https://github.com/timvdstap can help your through the process of completing the metadata record, making your data available online and generating a DOI.

Cheers, Ray

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/HakaiInstitute/hakai-datasets/issues/64#issuecomment-948688549, or unsubscribe https://github.com/notifications/unsubscribe-auth/AK7G4ZNWAUGB2FAOIX7Y2QTUIARGJANCNFSM5GCNOM2Q . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

raytula commented 2 years ago

Hi @hvdosser that approach sounds good to me....certainly simpler. Thanks, Ray

JessyBarrette commented 2 years ago

Hayley's Hakai Nutrient data is now available within this repository: https://github.com/HakaiInstitute/Dosser-et-al2021-hakai-nutrient-dataset

JessyBarrette commented 2 years ago

The metedata records for Hayley et al. 2021 paper is available here: https://cioos-siooc.github.io/metadata-entry-form/#/en/hakai/oOseiEWez6TzqPUIisEXczg6G8y2/-MoZtzH-n9LBy9Bn29Hl

I'm wondering about the paper's coauthor, Should they also be added to the contact list. @jenjax2 @raytula let me know your thoughts.

raytula commented 2 years ago

Re: I'm wondering about the paper's coauthor, Should they also be added to the contact list. Only if they were involved in preparing the dataset

JessyBarrette commented 2 years ago

@hvdosser a DOI is now available for your datasets : https://doi.org/10.21966/j3j5-wt70

I updated the metadata form to include it should be available on the CKAN record itself by tomorrow.

hvdosser commented 2 years ago

Hi Jessy,

Thank you, that's great!

Hayley

On Tue, Dec 14, 2021 at 10:51 AM Jessy Barrette @.***> wrote:

@hvdosser https://github.com/hvdosser a DOI is now available for you dataset : https://doi.org/10.21966/j3j5-wt70

I updated the metadata form to include it should be available on the CKAN record itself by tomorrow.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/HakaiInstitute/hakai-datasets/issues/64#issuecomment-993878974, or unsubscribe https://github.com/notifications/unsubscribe-auth/AK7G4ZKWO6WY3XHFOHPCMXTUQ6G3RANCNFSM5GCNOM2Q . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

JessyBarrette commented 2 years ago

I will close this issue since the dataset, metadata record and DOI are now all available!