Open glenrobson opened 3 years ago
It looks like we can get access to the records using the search api:
(note you have to register for an API key to get access to these two interfaces). Its a solr like json response and you can use the id
to get to the EDM record:
"id": "/232/https___digitalcollections_jtsa_edu_islandora_object_jts_3A18709_datastream_TN_view_Portrait_20of_20Ishmael_20Aga__jpg",
Which maps to:
and in there you can get the IIIF image URL /object/aggregations/webResources/svcsHasService
and manifest /object/aggregations/webResources/dctermsIsReferencedBy
. Note will have to check the manifest is a manifest as the example above is a IIIF image url... Not also there are multiple webResources.
Stats from the photograph collection:
Total of 209,236 records
Found 7 - country
Found 47 - dataProvider
Found 8 - provider
Found 14 - rights
So after the presentation at the IIIF conference Antoine pointed out that we could have got the data from here: https://pro.europeana.eu/page/harvesting-and-downloads#downloads which would have been a lot quicker!
🤯
For the record my colleague @Hobbesball has pointed that in January the dump may not have been available as they are. So it was rather an issue of unlucky timing, no regret to have!
First experiment is not going well... Europeana have a SPARQL end point and I should be able to run the following sparql to retrieve all IIIF images and manifests that contain "Pho" in the dc:type:
Unforunately this only returns two results. Both which contain Photographie in the dc:type but the search interface shows more.
It could be that the rdf database hasn't been updated since 2017...