internetarchive / openlibrary

One webpage for every book ever published!
https://openlibrary.org
GNU Affero General Public License v3.0
5.59k stars 1.53k forks source link

Document Data dumps #100

Closed anandology closed 5 years ago

anandology commented 13 years ago

Document the datadumps at http://openlibrary.org/developers/dumps.

It should have one section for each dump that is created with the format and explanation of each of the field present in each record.

bencomp commented 13 years ago

I put up a list of fields I found in January's datadump, combined with information from openlibrary.org/type on https://github.com/bencomp/openlibrary/blob/master/datadump-doc.txt Perhaps it's useful as a checklist.

bencomp commented 12 years ago

Could you add http://openlibrary.org/data/ol_dump_deworks_latest.txt.gz to the page as well? And document its contents :)

bencomp commented 12 years ago

Is it useful to add an explanation on how to download the dumps using BitTorrent?

sbshah97 commented 6 years ago

@mekarpeles looks like a Documentation Issue to me? Could we add this for Hacktoberfest?

mekarpeles commented 5 years ago

I think this can be closed; we're not hearing a lot of confusion about our data dumps. Unless @bencomp you have any interest in helping on this one-off :stuck_out_tongue:

bencomp commented 5 years ago

@mekarpeles It's been a few years since I last looked at the data dumps, and I don't see that I would have time anytime soon to help on this. But the core of the issue has not been addressed at all. Maybe it's true that noone is actually confused, although others may be scared away when they see that a complete lack of documentation of the data schema?