Closed anandology closed 5 years ago
I put up a list of fields I found in January's datadump, combined with information from openlibrary.org/type on https://github.com/bencomp/openlibrary/blob/master/datadump-doc.txt Perhaps it's useful as a checklist.
Could you add http://openlibrary.org/data/ol_dump_deworks_latest.txt.gz to the page as well? And document its contents :)
Is it useful to add an explanation on how to download the dumps using BitTorrent?
@mekarpeles looks like a Documentation Issue to me? Could we add this for Hacktoberfest?
I think this can be closed; we're not hearing a lot of confusion about our data dumps. Unless @bencomp you have any interest in helping on this one-off :stuck_out_tongue:
@mekarpeles It's been a few years since I last looked at the data dumps, and I don't see that I would have time anytime soon to help on this. But the core of the issue has not been addressed at all. Maybe it's true that noone is actually confused, although others may be scared away when they see that a complete lack of documentation of the data schema?
Document the datadumps at http://openlibrary.org/developers/dumps.
It should have one section for each dump that is created with the format and explanation of each of the field present in each record.