arthurpsmith / author-disambiguator

Wikidata service to help create or link author items to published articles
GNU General Public License v3.0
33 stars 8 forks source link

Adapt config for a wikibase different to wikidata? #154

Open dlindem opened 3 years ago

dlindem commented 3 years ago

Hello, I would like to use this tool on a wikibase instance different to wikidata. Is there an easy way to adapt config to that other wikibase (base url, author literal prop, author item prop, etc. etc.)? Many thanks in advance, David (http://data.lexbib.org)

arthurpsmith commented 3 years ago

Hi @dlindem - great question. Unfortunately as I've added features to the code I think it's become more and more dependent on the wikimedia ecosystem and details of wikidata... For your use case it definitely would be good to abstract those things better. Here are some of the places it would need to be done:

If you are interested in working on this yourself, let me know. Pull requests to address some of the above are welcome! I can also see this being something I might want to look into as well, having contemplated setting up a separate wikibase myself. Is your wikibase available to the public, so I could test with it?

dlindem commented 3 years ago

Hi @arthurpsmith, many thanks for your quick answer. Yes, my wikibase is available, look at the SPARQL query examples for getting an idea of the content, or e.g. this for a bibitem. I am doing bibdata migration from Zotero to my wikibase (and ultimately also to wikidata, but for curation I will keep it in mine). I am about to replace a batch of author/editor literals (P39/P42) with item statements (I used OpenRefine to cluster name variants to around 4.000 unique authors, which are just now created as new Q5 items (yes, humans are Q5 also in mine)). After that, your tool would be best option when including more bibitems, 95% of which will have known authors/editors. If you want to use my usecase for tests, perfect. I understand your hints, and will now create all necessary properties you list. As soon as I find more time, I would try to make the necessary edits to the authordisambiguator code, but I'd also have to find out how to install the tool (locally? on lexbib.org server? Elsewhere? Very little experience with such tasks).