Open JeremiahCurtis opened 7 years ago
I estimate this is about 21GB based on 3-6MB per day for 13 years.
~Running du-h
via lftp now. will update with file size and start downloading if it's small enough my my machine.~
Actually, I'm already hammering this server with another lftp du -h, i'll leave it be until that one finishes.
It's so many directories and small files that I ran du
on a wide sample, found pretty consistent results and extrapolated.
Cool, I'm gonna trust your extrapolation and just start downloading since the du -h
is still running
Download complete.
find podaac-ftp.jpl.nasa.gov/allData/windsat/ -type f -exec md5sum {} \; | sort -k 2 | md5sum
returns
0785732d2d84369ef5c6cee16a10f961
What was the size?
Looks like it's about 17GB
Mirroring now. I'll produce a torrent when it's done.
Mirroring
@markuslaker If you make the torrent directly from the mirrored directory structure (without zipping up the entire directory structure), I think anyone who has mirrored the source could mirror your torrent without needing to download the entire torrent again. Though before doing so, we would want to verify that the source has been mirrored correctly via hashes.
@axlecrusher, thanks for the hint. I'll do that.
Completed though my md5sum is the following
find podaac-ftp.jpl.nasa.gov/allData/windsat/ -type f -exec md5sum {} \;| grep -v \\.listing | sort -k 2 | md5sum df311d07ebf6e13fb18498255a9ae164 -
I just noticed that this directory is still being updated daily. That would account for the md5sum mismatch.
It took a while to download, but the magnet link is below. It covers a directory tree, as @axlecrusher suggested; I've not zipped up the data.
No one has used the magnet link I posted under Issue 293, and so I'm not sure I'm doing any real good here. Is it possible to get these magnet links posted somewhere more visible, so that the data can start to be distributed more widely?
magnet:?xt=urn:btih:BMAXR6SCAV7ACQ3SFKQTECXYRDSKKC4K&dn=ftp_podaac-ftp_jpl_nasa_gov_allData_windsat&tr=udp%3a%2f%2ftracker.coppersurfer.tk%3a6969%2fannounce&tr=udp%3a%2f%2ftracker.leechers-paradise.org%3a6969%2fannounce&tr=udp%3a%2f%2ftracker.zer0day.to%3a1337%2fannounce&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce&tr=http%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce&tr=udp%3a%2f%2fp4p.arenabg.com%3a1337%2fannounce&tr=http%3a%2f%2fp4p.arenabg.com%3a1337%2fannounce&tr=udp%3a%2f%2f9.rarbg.com%3a2710%2fannounce&tr=udp%3a%2f%2fexplodie.org%3a6969%2fannounce&tr=http%3a%2f%2fexplodie.org%3a6969%2fannounce&tr=http%3a%2f%2ftracker.dler.org%3a6969%2fannounce&tr=udp%3a%2f%2ftracker.internetwarriors.net%3a1337%2fannounce&tr=udp%3a%2f%2ftracker1.wasabii.com.tw%3a6969%2fannounce&tr=udp%3a%2f%2ftracker2.wasabii.com.tw%3a6969%2fannounce&tr=udp%3a%2f%2ftracker.mg64.net%3a6969%2fannounce&tr=udp%3a%2f%2fmgtracker.org%3a6969%2fannounce&tr=udp%3a%2f%2fmgtracker.org%3a2710%2fannounce&tr=udp%3a%2f%2fipv4.tracker.harry.lu%3a80%2fannounce&tr=http%3a%2f%2ftracker1.wasabii.com.tw%3a6969%2fannounce&tr=http%3a%2f%2fmgtracker.org%3a6969%2fannounce&tr=http%3a%2f%2ftracker2.wasabii.com.tw%3a6969%2fannounce&tr=http%3a%2f%2ftracker.sktorrent.net%3a6969%2fannounce&tr=http%3a%2f%2ftracker.mg64.net%3a6881%2fannounce&tr=http%3a%2f%2fipv4.tracker.harry.lu%3a80%2fannounce&tr=udp%3a%2f%2ftracker.vanitycore.co%3a6969%2fannounce&tr=udp%3a%2f%2ftracker.baravik.org%3a6970%2fannounce&tr=udp%3a%2f%2fbt.xxx-tracker.com%3a2710%2fannounce&tr=http%3a%2f%2ftracker.vanitycore.co%3a6969%2fannounce&tr=http%3a%2f%2ftracker.baravik.org%3a6970%2fannounce&tr=udp%3a%2f%2ftracker.kamigami.org%3a2710%2fannounce&tr=udp%3a%2f%2ftracker.grepler.com%3a6969%2fannounce&tr=udp%3a%2f%2ftracker.filetracker.pl%3a8089%2fannounce
Definitely not 17GB or 21GB...I'm getting 39GB and still downloading for windsat. In my logs, it seems like there is a .snapshot folder in many of the subfolders. I think your original estimate didn't take these hidden folders into account. My wget --mirror is downloading these anyways, although I don't know if .snapshot has duplicates.
UPDATE: Yah. .snapshot has duplicates, so no need to mirror that.
Seems like this data is updated daily. Same exact data is also available at: http://data.remss.com/windsat/.
@markuslaker posting magnet links is helpful. We hope to aggregate the comments in these issues into a searchable/browsable index soon.
@axlecrusher @erikfriesen Are your mirrors public?
ftp://podaac-ftp.jpl.nasa.gov/allData/windsat/