ipfs-inactive / archives

[ARCHIVED] Repo to coordinate archival efforts with IPFS
https://awesome.ipfs.io/datasets
183 stars 24 forks source link

Stackexchange Archives #50

Open dignifiedquire opened 8 years ago

dignifiedquire commented 8 years ago

These are published here https://archive.org/download/stackexchange

I've downloaded them to my machine and added them to ipfs 0.4. Currently they are being pinned by biham.

The folder hash is QmYgHvTrSfPJH5Dswq6NB8wTHH77BFaJdLP8UBYJz9Wz19 with the nested parts being listed here

davidar commented 8 years ago

Awesome, thanks @Dignifiedquire :)

dignifiedquire commented 8 years ago

http://v04x.ipfs.io/ipfs/QmYgHvTrSfPJH5Dswq6NB8wTHH77BFaJdLP8UBYJz9Wz19 is showing the listing :) official pinning on biham is still running

RichardLitt commented 8 years ago

Is this done, then? What remains?

dignifiedquire commented 8 years ago

Still not finished pinning it onto biham :(

eminence commented 8 years ago

At a minimum, we should include a datapackage.json file, as described in https://github.com/ipfs/archives/issues/45

Also, if possible, the script used to create this archive should also be included, so that others can help keep it up-to-date.

Finally, it would be super-awesome if we also had some way to interface with this data. Having an archive of this stuff is important, but in its current form it is of limited use to users (both IPFS users and Stackexhange users)

dignifiedquire commented 8 years ago

Also, if possible, the script used to create this archive should also be included, so that others can help keep it up-to-date.

No script, just manual labour.

Finally, it would be super-awesome if we also had some way to interface with this data. Having an archive of this stuff is important, but in its current form it is of limited use to users (both IPFS users and Stackexhange users)

It would be, but not sure how as this is mostly gigantic xml files.