climate-mirror / datasets

For tracking data mirroring progress
201 stars 18 forks source link

Dataset at ftp:/airbornescience.nsstc.nasa.gov/grip/HIRAD #133

Open nickrsan opened 7 years ago

nickrsan commented 7 years ago

ftp:/airbornescience.nsstc.nasa.gov/grip/HIRAD.

Suggested in a large email containing many urls

stevelord commented 7 years ago

There's no grip folder on this FTP server. An ls of the / folder reveals:

drwxr-xr-x 22 ftp ftp 23 Jan 24 18:55 . drwxr-xr-x 6 ftp ftp 7 Jun 24 2016 .. drwxr-xr-x 6 ftp ftp 6 Jun 24 2014 amsre drwxr-xr-x 5 ftp ftp 6 Jun 29 2015 amsu-a drwxr-xr-x 4 ftp ftp 4 Mar 12 2013 amsutemps drwxr-xr-x 8 ftp ftp 8 Jun 29 2015 browse drwxr-xr-x 6 ftp ftp 6 Dec 7 21:58 browse_sample drwxr-xr-x 3 ftp ftp 3 May 25 2016 case_studies drwxr-xr-x 11 ftp ftp 11 Mar 1 2016 data drwxr-xr-x 36 ftp ftp 36 Oct 25 16:31 doc drwxr-xr-x 11 ftp ftp 11 Nov 2 21:46 fieldCampaigns drwxr-xr-x 3 ftp ftp 4 Aug 7 2013 globalir drwxr-xr-x 4 ftp ftp 4 Aug 28 2014 lis drwxr-xr-x 6 ftp ftp 6 Feb 2 2015 lma drwxr-xr-x 6 ftp ftp 6 Jan 25 2016 msu drwxr-xr-x 5 ftp ftp 5 Dec 4 2014 rasi lrw-r--r-- 1 ftp ftp 10 Nov 14 2011 sandbox -> ../sandbox drwxr-xr-x 8 ftp ftp 8 Jul 26 2012 ssmi drwxr-xr-x 4 ftp ftp 4 Sep 21 2012 ssmis drwxr-xr-x 4 ftp ftp 4 Jun 7 2013 storms drwxr-xr-x 3 ftp ftp 3 Dec 10 2015 tcpf drwxr-xr-x 4 ftp ftp 4 Apr 26 2016 tpw drwxr-xr-x 4 ftp ftp 4 Apr 26 2016 wind_climatology

donbright commented 7 years ago

Appears to be behind a username/password wall

http://airbornescience.nsstc.nasa.gov/grip

https://ghrc.nsstc.nasa.gov/hydro/search.pl?hydro&pr=GRIP

https://ghrc.nsstc.nasa.gov/uso/ds_docs/grip/griphirad/griphirad_dataset.html


confirmed. once you sign up for an earthdata username password, you can access the https server with this url

https://fcportal.nsstc.nasa.gov/pub/grip/

it looks like this might be the same data as in the ftp url

You can browse this parent directory, as shown below, but when you 'drill down' clicking into the 'data' subdirectories you get redirected to an Earthdata login (not pictured).

delme

Also you have to set your browser to temporarily ignore their SSL certificate problems.

donbright commented 7 years ago

i attempted to mirror this at

offline, contact me if data is needed

hashdeep: offline, contact me if data is needed hashdeep signature: offline, contact me if data is needed

but im not sure it if worked. It is only about 1.1 Gigabyte. following these tips on linux:

https://wiki.earthdata.nasa.gov/display/EL/How+To+Access+Data+With+cURL+And+Wget


cd ~
touch .netrc
echo "machine urs.earthdata.nasa.gov login <uid> password <password>" > .netrc
chmod 0600 .netrc
cd ~
touch .urs_cookies

curl  -k -b ~/.urs_cookies -c ~/.urs_cookies -L -n https://fcportal.nsstc.nasa.gov/pub/grip/HIRAD/v1/data/

the curl line is only to authenticate Earthdata username/password

then i used wget to do the fetch

wget -r -np --user=xxxxx --password=xxxxx  --no-check-certificate --load-cookies ~/.urs_cookies  https://fcportal.nsstc.nasa.gov:/pub/grip/HIRAD/
wantonwonton commented 7 years ago

@donbright Using wget's -r option defaults to only 5 levels of recursion, so is it possible that your download didn't get all the files? You might want to retry with some other options (perhaps including -N, to not download any file again if its timestamp hasn't changed).

I just posted some notes on wget options here.

donbright commented 7 years ago

thanks for showing me that, @wantonwonton i re-ran with wget -m -np and there only difference was a handful of strange files like ./pub/grip/HIRAD/v0/data/EARL/tb/index.html?C=M;O=D: , but they are all at the same depth.

i will check my other wgets and make sure they were correct.