climate-mirror / datasets

For tracking data mirroring progress
201 stars 18 forks source link

EPA Air Quality Data archives #362

Open rsargent opened 7 years ago

rsargent commented 7 years ago

New Dataset

Please fill out the New Issue form so we can easily organise the issues and put a priority on certain data. The title should reflect the dataset you want us to save. In the body, please include the following information.

By adding the government agency and the division which collects the data, it will make it much easier to catalogue and prioritise!

Example:

JeremiahCurtis commented 7 years ago

Finished....965 zip files 12.7 GB (13,661,205,194 bytes)

HostileGranola commented 7 years ago

I'm currently downloading this using aria2c -i list.txt -d aqsdr1.epa.gov/aqsweb/aqstmp/airdata/ -x 10 -c. The list is a plaintext version of this file stripped down to the first column, with "http://aqsdr1.epa.gov/aqsweb/aqstmp/airdata/" appended to the start of each line.

I will post hashes from hashdeep -erl and file sizes from du -b --max-depth=1 --human-readable once complete.

HostileGranola commented 7 years ago

I have a copy of this dataset as of 2017-03-28 02:40:00 UTC

Hash results are here

Size results are here

HostileGranola commented 7 years ago

@JeremiahCurtis I notice our downloads are different sizes, is it possible for you to compare your results with mine? I can make my list file available if that would be of use to you.

Juerd commented 7 years ago

http://[2a03:b0c0:2:d0::1dae:1001]/EPA_Air_Quality_Data/ http://188.166.4.6/EPA_Air_Quality_Data/

Will stay online until the 1 TB transfer limit is reached (this should allow for 75+ more full mirrors), or a month has passed, whichever comes first.

as-com commented 7 years ago

Second mirror: https://mirrors.asun.co/climate-mirror/aqsdr1.epa.gov/aqsweb/aqstmp/airdata/ hashdeep: https://mirrors.asun.co/climate-mirror/aqsdr1.epa.gov/aqsweb/aqstmp/airdata/hashdeep.txt

CorentinB commented 7 years ago

I'm currently downloading it using lftp mirror.

x775 commented 7 years ago

I have a complete copy as of this posting.

md5: 368914f7fbc4f5d93771ae28783a02ff sha256: 7db788976a63088f2aa6f57b9ccefa736e819fc2fa510e39bd33a6bded16275a

Individual checksums: https://gist.github.com/x775/ddfbd273faeec0a610a8e4c297fca4c9 Direct download links: https://gist.github.com/x775/182339a9649cda0ded5d47c8276c17af

Size: 12.71862GB GB

Compressed name: EPA_Air_Quality_Data_Archives.7z Compressed md5: bf7445c9ada22a92aaca4dc3f737be2e
Compressed sha256: 94ab392f322c4c156349d4af86d50d5272d420716f32990b40a5ea8d50ca01b8 Compressed size: 9.04773GB Compressed download link: https://drive.google.com/open?id=0B6PlQrUTwL1PU1FEQWdFMms1d0k

entr0p1 commented 7 years ago

Currently downloading

entr0p1 commented 7 years ago

All done

Name: aqsdr1.epa.gov Checksum(s): https://gateway.ipfs.io/ipfs/QmQQXfNAFNgvsCdLSJDxVv8e5xdU3WHUTX2F7u66Hg6oJt Root Directory: https://gateway.ipfs.io/ipfs/QmSrcwafapwbK27sBEsWs5vuefbRgh4xk7FQXecrqRppyi Total Size: 12.7GB

Edit: goofed up the checksum link - my bad!

DanTheMan827 commented 7 years ago

It's not a mirror per-say, but here's a torrent with web seeds pointing to all current compatible mirrors

This should make things easier for making new mirrors.

https://gist.github.com/DanTheMan827/16eaf6d45f7fa443d7bacfabdd0060dc/raw/5da72fdc2f9a9bb63ba2f71e03212b77c91b7eed/EPA%2520Air%2520Quality%2520Data%2520archives%2520%2523362%2520-%25202017-05-11.torrent

Koxzi95 commented 6 years ago

Here is a temporary mirror whilst I get a more robust one setup. ftp://13.81.107.248/epa-air-quality-historical/

HostileGranola commented 4 years ago

I no longer have this dataset due to storage space constraints. Apologies.