Closed baskaufs closed 2 years ago
I found my notes in a Word doc (action-items.docx): Notes from the 2021-05-04 meeting on Structured Data in Commons (LD4 Wikidata meeting)
Call notes: https://docs.google.com/document/d/1Es7BP60JMv07jZGDmF4TJld7KxZu3cDYYQdzoM_jrRM/edit
Presenters: Alicia Fagerving alicia.fagerving@wikimedia.se John Cummings Wikimedian in Residence, UNESCO
Wikidata+Commons example https://meta.wikimedia.org/wiki/Structured_data_for_GLAM-Wiki/Roundtripping/KMB
Best practice is to put data for the images on Commons and for the things in the image in Wikidata.
o Structured data lets you add data about photo using properties from Wikidata Depicts--most important property in SDC • What is in the file • Links to Wikidata item • To see all depicts for an item: Type in search field: o haswbstatement:P180=Q750444 Creator • For Wikimedia users: o Object has role: photographer o Author name string: o Wikimedia username o URL: Inception • Date photo taken Coordinates • Fetched from exif data
Search engines depend on depicts data, so important.
How to link image of work on Commons side--use "depicts" for what’s in image and use "digital representation of" property to link to Wikidata item
Structured data modeling discussion: https://commons.wikimedia.org/wiki/Commons:Structured_data/Modeling
At an LD4 meeting, structured data was discussed. Here are some notes I took:
That last URL leads to the Image Annotator tool: https://image-annotator.toolforge.org/
Discovered that there is a Commons category for Vanderbilt University Fine Arts Gallery. We should use it to tag the images when they are uploaded to Commons. Also, we should make sure that the images that are there (mostly Vanderbilt family portraits) are represented as Wikidata items that are linked to the Gallery.
For public domain works using the Artwork template, the license property doesn't need to be provided in Structured data. However, it will get flagged if it doesn't have a P6243 (digital representation of) statement in Structured data linking the work back to the Wikidata artwork item. Not sure if that flag gets set when the Wikidata Q ID is given in the template or if it always gets flagged.
Incremental work to switch commonsbot script to use artwork template https://github.com/HeardLibrary/linked-data/commit/3a2e1916605ddd3d8dbbf15dc36897ab4f534312
The test script for uploading Commons items is commonsbot.ipynb. It's current status is that it can upload individual items or a series of items using a loop. The loop uses the idiosyncratic data source upload_metadata.csv, which was purpose-built for uploading the Vandycite videos.
There are several things that need to be improved in this script before it is ready to be used in production:
create_commons_claim()
, creates individual claims one at a time. If there are going to be more than one or two claims added per image, this should be changed from using thewbcreateclaim
method to usingwbeditentity
that VanderBot uses. That method also allows the image Wikibase label (the "image caption") to be set at the same time instead of using thewbsetlabel
method used in theset_commons_image_caption()
function.