IMCR-Hackathon / Hackathon-Central-2019

Command center for 2019 hackathon participants to share ideas, coordinate teams, develop projects and access all logistics information
4 stars 1 forks source link

Datasets #10

Open CoastalPlainSoils opened 5 years ago

CoastalPlainSoils commented 5 years ago

Problematic dataset due to column names not displayed properly.

Screen Shot 2019-06-11 at 2 20 43 PM

https://portal.edirepository.org/nis/mapbrowse?packageid=knb-lter-bes.950.420

wetlandscapes commented 5 years ago

A typical data set: https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/edi/216/1. Specifically: hydrology_data.csv

ktoddbrown commented 5 years ago

A Pangaea data set with extra header information but otherwise well formatted https://doi.pangaea.de/10.1594/PANGAEA.890471

jhp7e commented 5 years ago

Here is a typical dataset: https://pasta.lternet.edu/package/metadata/eml/knb-lter-vcr/153/24 - raw metadata https://doi.org/10.6073/pasta/7e48a6e1fb576a5be7b20ffbbaa10503 - landing page

And here is a problematic dataset - too many columns, lots of missing data, some range and unit issues. https://pasta.lternet.edu/package/metadata/eml/knb-lter-vcr/247/10 - raw metadata https://doi.org/10.6073/pasta/b650b236f092e0fdee0d5d8ccf521cb3 - landing page

CoastalPlainSoils commented 5 years ago

Dataset with code generation line (and R code....) https://portal.edirepository.org/nis/codeGeneration?packageId=knb-lter-bes.549.160&statisticalFileType=r

sheilasaia commented 5 years ago

Non-problematic LTER daily stream water chemistry dataset: https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-knz%2F125%2F1%2Ff9f485f53fcbad9222e9f53394f9a826 (download link for PBG111.csv file). You can view the complete dataone entry at: https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-knz/125/1.

Sample R code to download: metajam::download_d1_data(data_url = "https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-knz%2F125%2F1%2Ff9f485f53fcbad9222e9f53394f9a826", path = "<your destination path>")

ktoddbrown commented 5 years ago

Very clean soils data: https://esajournals.onlinelibrary.wiley.com/doi/full/10.1002/ecy.2159

CoastalPlainSoils commented 5 years ago

Clean data table with code generation also.... https://portal.edirepository.org/nis/mapbrowse?packageid=knb-lter-bes.3130.100

vanderbi commented 5 years ago

Typical dataset : https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-sev/289/239911 (Sevilleta LTER NPP Quadrat Data)

alesiahallmark commented 5 years ago

Flux tower data (fun because of the high temporal frequency): https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-hbr%2F241%2F2%2Fb05f44eacc901906ac386b1bbb0414a4

Line-intercept data (simple but strange spatial data): https://cn.dataone.org/cn/v2/resolve/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fdata%2Feml%2Fknb-lter-sev%2F4%2F167418%2Fb702fe7865ddefe3d28061131a50434a

Spatially distributed data, with two data tables, one for data, one for sites: https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-cap/41/14

atn38 commented 5 years ago

I advocate for datasets with explicit spatial and temporal columns :)

Subsistence and personal use harvest of salmon in Alaska, 1960-2012 https://search.dataone.org/view/doi:10.5063/F18P5XTN

lkuiucsb commented 5 years ago

Three data tables, long time series, multiple measurements https://portal.edirepository.org/nis/mapbrowse?scope=knb-lter-sbc&identifier=45

ktoddbrown commented 5 years ago

This is now here 121a7311a7c971e4862cae1b1f1872e2240e3cf5

And here is a problematic dataset - too many columns, lots of missing data, some range and unit issues. https://pasta.lternet.edu/package/metadata/eml/knb-lter-vcr/247/10 - raw metadata https://doi.org/10.6073/pasta/b650b236f092e0fdee0d5d8ccf521cb3 - landing page

ktoddbrown commented 5 years ago

This is now here: R/testData_SavilletaNPP.R @vanderbi

Typical dataset : https://search.dataone.org/view/https://pasta.lternet.edu/package/metadata/eml/knb-lter-sev/289/239911 (Sevilleta LTER NPP Quadrat Data)

clnsmth commented 5 years ago

Here's the example dataset for read_data_archived(). An Arctic Data Center data package