Open CoastalPlainSoils opened 5 years ago
A typical data set: https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/edi/216/1. Specifically: hydrology_data.csv
A Pangaea data set with extra header information but otherwise well formatted https://doi.pangaea.de/10.1594/PANGAEA.890471
Here is a typical dataset: https://pasta.lternet.edu/package/metadata/eml/knb-lter-vcr/153/24 - raw metadata https://doi.org/10.6073/pasta/7e48a6e1fb576a5be7b20ffbbaa10503 - landing page
And here is a problematic dataset - too many columns, lots of missing data, some range and unit issues. https://pasta.lternet.edu/package/metadata/eml/knb-lter-vcr/247/10 - raw metadata https://doi.org/10.6073/pasta/b650b236f092e0fdee0d5d8ccf521cb3 - landing page
Dataset with code generation line (and R code....) https://portal.edirepository.org/nis/codeGeneration?packageId=knb-lter-bes.549.160&statisticalFileType=r
Non-problematic LTER daily stream water chemistry dataset: https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-knz%2F125%2F1%2Ff9f485f53fcbad9222e9f53394f9a826 (download link for PBG111.csv file). You can view the complete dataone entry at: https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-knz/125/1.
Sample R code to download: metajam::download_d1_data(data_url = "https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-knz%2F125%2F1%2Ff9f485f53fcbad9222e9f53394f9a826", path = "<your destination path>")
Very clean soils data: https://esajournals.onlinelibrary.wiley.com/doi/full/10.1002/ecy.2159
Clean data table with code generation also.... https://portal.edirepository.org/nis/mapbrowse?packageid=knb-lter-bes.3130.100
Typical dataset : https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-sev/289/239911 (Sevilleta LTER NPP Quadrat Data)
Flux tower data (fun because of the high temporal frequency): https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-hbr%2F241%2F2%2Fb05f44eacc901906ac386b1bbb0414a4
Line-intercept data (simple but strange spatial data): https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-sev%2F4%2F167418%2Fb702fe7865ddefe3d28061131a50434a
Spatially distributed data, with two data tables, one for data, one for sites: https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-cap/41/14
I advocate for datasets with explicit spatial and temporal columns :)
Subsistence and personal use harvest of salmon in Alaska, 1960-2012 https://search.dataone.org/view/doi:10.5063/F18P5XTN
Three data tables, long time series, multiple measurements https://portal.edirepository.org/nis/mapbrowse?scope=knb-lter-sbc&identifier=45
This is now here 121a7311a7c971e4862cae1b1f1872e2240e3cf5
And here is a problematic dataset - too many columns, lots of missing data, some range and unit issues. https://pasta.lternet.edu/package/metadata/eml/knb-lter-vcr/247/10 - raw metadata https://doi.org/10.6073/pasta/b650b236f092e0fdee0d5d8ccf521cb3 - landing page
This is now here: R/testData_SavilletaNPP.R @vanderbi
Typical dataset : https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-sev/289/239911 (Sevilleta LTER NPP Quadrat Data)
Here's the example dataset for read_data_archived()
. An Arctic Data Center data package
Problematic dataset due to column names not displayed properly.
https://portal.edirepository.org/nis/mapbrowse?packageid=knb-lter-bes.950.420