internetarchive / openlibrary-client

Python Client Library for the Archive.org OpenLibrary API
GNU Affero General Public License v3.0
371 stars 81 forks source link

Bot Opportunies #165

Open xayhewalo opened 4 years ago

xayhewalo commented 4 years ago

Preface

The old Writing Bots Wiki used to have a list of "Bot Opportunities" that have been migrated here. The list migrated from the wiki will likely need further breakdown.

Contributors can edit this issue to track all bot-related clean up

Legend

Bot Clean up Opportunities

BrittanyBunk commented 4 years ago

@guyjeangilles I haven't created bots before (I'll look into it more), but I wish there's some description of how bots are created (like what databases they use - do they use the bulk data dump or the API? Idk how to access the API - as the website doesn't say).

BrittanyBunk commented 4 years ago

I think this relates to internetarchive/openlibrary#2902, because without data to be able to be easy to work with - it's going to be difficult to parse through to figure out that bots are needed and how to create bots to work with the data. Bots may also be able to work faster with less data too!

xayhewalo commented 4 years ago

@BrittanyBunk The API documentation could be better but you can most of it here: https://openlibrary.org/developers/api

Also the Bot Wiki is a little outdated but it's a good place to start. @hornc is a good resource as well.

BrittanyBunk commented 4 years ago

@guyjeangilles If that Bot Wiki could be added to the API documentation page on the OL, it would be an enormous help. I think that's the missing connection for explaining how to make bots! It's also nice that there's a new 'read' section too.

BrittanyBunk commented 4 years ago

I am going to list my bot opportunity ideas to keep in place here.

LeadSongDog commented 2 years ago

Another one: many author records have a trailing space in the name. These frequently are duplicates of records for the same author sans space. They should be corrected and, where appropriate, merged. See for example https://openlibrary.org/works/OL17769107W