climate-mirror / datasets

For tracking data mirroring progress
201 stars 18 forks source link

NCDC Global Databank #317

Open JeremiahCurtis opened 7 years ago

JeremiahCurtis commented 7 years ago

https://www1.ncdc.noaa.gov/pub/data/globaldatabank/

estimated size: 232 GB

"The Global Land Surface Databank combines NCDC’s Global Historical Climatology Network–Daily dataset, which includes over 27,000 stations as its foundation, with other data sources from NCDC and those collected by ISTI to produce an integrated databank with approximately 32,000 station records. Additionally, the Global Land Surface Databank provides users with a way to completely track the origin of surface air temperature data from their earliest available source through their integration into the databank." description from https://www.ncdc.noaa.gov/news/release-global-land-surface-databank

see issue #162

ewooonk commented 7 years ago

I'm downloading this using wget -m --page-requisites --adjust-extension --no-parent --convert-links -e robots=off https://www1.ncdc.noaa.gov/pub/data/globaldatabank/

ewooonk commented 7 years ago

ftp://ftp.ncdc.noaa.gov/pub/data/globaldatabank

Ichimonji10 commented 7 years ago

Downloading with:

wget \
  --mirror \
  --output-file=ftp.ncdc.noaa.gov.log \
  --no-verbose \
  --limit-rate=2m \
  ftp://ftp.ncdc.noaa.gov/pub/data/globaldatabank
ewooonk commented 7 years ago

I have a offline copy of this dataset.

Ichimonji10 commented 7 years ago

I have an offline copy of the dataset, too. Some quick numbers:

$ du -h --max-depth=1
235G    ./ftp.ncdc.noaa.gov
235G    .
$ find ftp.ncdc.noaa.gov -type f | wc
  37482   37657 3529271

Download log and hashdeep available at http://www.ichimonji10.name/climatemirror/ftp.ncdc.noaa.gov.pub_data_globaldatabank/

ewooonk commented 7 years ago

Hashdeep: https://dl.dropboxusercontent.com/u/8962137/noaa_global_databank_hash_audit.txt

ewooonk commented 7 years ago

I still have an separate offline copy. Mirror: https://wageningenur4-my.sharepoint.com/personal/ewout_oonk_wur_nl/_layouts/15/guestaccess.aspx?folderid=0cb65881bf751479c8eded688aadeb450&authkey=AXlON4fJDy9i3579X1jG6i8