plantphys / gsti

A project focused on the development of generalized spectra-trait models for the prediction of leaf photosynthetic capacity. This includes models focused on the prediction of leaf nitrogen, leaf mass per area (LMA), leaf water content (LWC), Vcmax, Jmax and dark respiration.
GNU General Public License v3.0
6 stars 1 forks source link

Develop data curation pipeline #2

Closed serbinsh closed 1 year ago

serbinsh commented 2 years ago

Rough example here:

https://github.com/serbinsh/SSerbin_NGEEArctic_Spectra_Trait

serbinsh commented 2 years ago

Steps [rough] for curation:

  1. Identify datasets & source URL
  2. Pull in "raw" data
  3. Identify variables and units
  4. Curate raw data into GVP standard format (units, variable names, etc) - base on gasex standard
  5. Generate QA/QC for each dataset (auto+manual). Spec quality, gasex quality, other flags
  6. Combine curated data with larger dataset
serbinsh commented 2 years ago

We need to create an issue for each new dataset so we can add them to the project board @JulienLamour so perhaps dataset, citation, source in the issue