climate-mirror / datasets

For tracking data mirroring progress
201 stars 18 forks source link

Snow Telemetry (SNOTEL) #111

Open nickrsan opened 7 years ago

nickrsan commented 7 years ago

Name: Snow Telemetry (SNOTEL) Organization: NRCS Description URL: http://www.wcc.nrcs.usda.gov/snow/ Download URL: )https://wcc.sc.egov.usda.gov/nwcc/tabget (Data can be downloaded by subbing in state name, station ID, and variable in the following url: https://wcc.sc.egov.usda.gov/reportGenerator/view_csv/customSingleStationReport/daily/NA::SNTL%7Cid=%22%22%7Cname/POR_BEGIN,POR_END/PREC::value File Types: Flat files (.csv) Size: ~1 GB? Status:

johncronan commented 7 years ago

NWCC also has some instructions on scripted downloads here: http://www.wcc.nrcs.usda.gov/web_service/NWCC_Web_Report_Scripting.txt

I'm looking into this one.

elfplease commented 7 years ago

I'm looking into this one too.

johncronan commented 7 years ago

@elfplease I've finished coding this: the SNOTEL and SCAN data are downloading now. But there is also #41 - refer to my comment there.

elfplease commented 7 years ago

@johncronan I coded it too. It's mostly finished downloading. Public mirror is here: http://158.69.33.231/ My download script is here: http://158.69.33.231/snotel/crawl.sh

johncronan commented 7 years ago

@elfplease You used customSingleStationReport instead of raw data download as recommended in the NWCC notes (as a result, you don't get the full set of columns, since that varies by station).

And there is also hourly data.

I've got this one. You should note which issues have the in progress tag before you dive in!

elfplease commented 7 years ago

@johncronan The instructions here say it's still worthwhile downloading those which have only one copy. It looked like an interesting challenge so I thought I'd try it. I downloaded everything mentioned in the original description; looks like there is indeed more. If you share your download scripts then maybe others can use them too?

johncronan commented 7 years ago

@elfplease I guess it does say that.

I'll post the scripts tomorrow, after the hourly data has finished downloading.

nickrsan commented 7 years ago

Hi all,

Thank you for downloading data (both of you) - we ideally are aiming for at least two mirrors of each dataset 1) For redundancy and 2) For exactly the reason you both encountered, where each copy may not include the same data for copies not done by crawling simple pages or directories. Thank you!

johncronan commented 7 years ago

@nickrsan Okay, that makes good sense. :)

nwcc_scripts.tar.gz

johncronan commented 7 years ago

Mirror:

http://s3.ca-central-1.amazonaws.com/climate-mirror/www.wcc.nrcs.usda.gov/SNOTEL%2BSCAN/nwcc_scripts.tar.gz http://s3.ca-central-1.amazonaws.com/climate-mirror/www.wcc.nrcs.usda.gov/SNOTEL%2BSCAN/nwcc_daily.tar.bz2 (148 MB) http://s3.ca-central-1.amazonaws.com/climate-mirror/www.wcc.nrcs.usda.gov/SNOTEL%2BSCAN/nwcc_hourly.tar.bz2 (791 MB)

melari commented 7 years ago

Another mirror: https://drive.loft.hosting/s/x1pzTdA6LXRI9ME