HeardLibrary / vandycite

0 stars 0 forks source link

Commons upload script test #29

Closed baskaufs closed 2 years ago

baskaufs commented 2 years ago

The test script for uploading Commons items is commonsbot.ipynb. It's current status is that it can upload individual items or a series of items using a loop. The loop uses the idiosyncratic data source upload_metadata.csv, which was purpose-built for uploading the Vandycite videos.

There are several things that need to be improved in this script before it is ready to be used in production:

baskaufs commented 2 years ago

I found the example YAML file from Andrew. It's in this gist.

baskaufs commented 2 years ago

I found my notes in a Word doc (action-items.docx): Notes from the 2021-05-04 meeting on Structured Data in Commons (LD4 Wikidata meeting)

Call notes: https://docs.google.com/document/d/1Es7BP60JMv07jZGDmF4TJld7KxZu3cDYYQdzoM_jrRM/edit

Presenters: Alicia Fagerving alicia.fagerving@wikimedia.se John Cummings Wikimedian in Residence, UNESCO

Wikidata+Commons example https://meta.wikimedia.org/wiki/Structured_data_for_GLAM-Wiki/Roundtripping/KMB

Best practice is to put data for the images on Commons and for the things in the image in Wikidata.

o Structured data lets you add data about photo using properties from Wikidata  Depicts--most important property in SDC • What is in the file • Links to Wikidata item • To see all depicts for an item: Type in search field: o haswbstatement:P180=Q750444  Creator • For Wikimedia users: o Object has role: photographer o Author name string: o Wikimedia username o URL:  Inception • Date photo taken  Coordinates • Fetched from exif data

Search engines depend on depicts data, so important.

How to link image of work on Commons side--use "depicts" for what’s in image and use "digital representation of" property to link to Wikidata item

Structured data modeling discussion: https://commons.wikimedia.org/wiki/Commons:Structured_data/Modeling

baskaufs commented 2 years ago

At an LD4 meeting, structured data was discussed. Here are some notes I took: image image

That last URL leads to the Image Annotator tool: https://image-annotator.toolforge.org/

baskaufs commented 2 years ago

Discovered that there is a Commons category for Vanderbilt University Fine Arts Gallery. We should use it to tag the images when they are uploaded to Commons. Also, we should make sure that the images that are there (mostly Vanderbilt family portraits) are represented as Wikidata items that are linked to the Gallery.

baskaufs commented 2 years ago

For public domain works using the Artwork template, the license property doesn't need to be provided in Structured data. However, it will get flagged if it doesn't have a P6243 (digital representation of) statement in Structured data linking the work back to the Wikidata artwork item. Not sure if that flag gets set when the Wikidata Q ID is given in the template or if it always gets flagged.

baskaufs commented 2 years ago

Incremental work to switch commonsbot script to use artwork template https://github.com/HeardLibrary/linked-data/commit/3a2e1916605ddd3d8dbbf15dc36897ab4f534312

baskaufs commented 2 years ago

Completed with https://github.com/HeardLibrary/linked-data/commit/ce34ff34438db5e0abc87dc769e646d8f8e75cbd