POLDER-Crew / polder-federated-search

A federated search project for POLDER.
BSD 3-Clause "New" or "Revised" License
5 stars 1 forks source link

API-based indexing for Gleaner #183

Closed yemoski closed 1 year ago

yemoski commented 1 year ago

DataStream has an API that returns JSON-LD, given an API key in the request header.

CCADI has a paged JSON-LD API.

On https://api.datacite.org/graphql, you can do things like { organization(id: "ror.org/01rhff309") { datasets { totalCount nodes { id titles { title } relatedIdentifiers { relatedIdentifier relationType } fundingReferences { awardTitle awardNumber } descriptions { description } creators { name familyName } geolocations { geolocationBox{ eastBoundLongitude westBoundLongitude southBoundLatitude northBoundLatitude } geolocationPlace geolocationPoint{ pointLatitude pointLongitude } } } pageInfo { endCursor hasNextPage } } } }

and get something that's shaped sort of like what we want.

Given the right changes to gleaner, the things that these APIs return could be loaded directly into the triplestore.

yemoski commented 1 year ago

CIOOS uses a CKAN API. It'd be pretty neat to support CKAN, that's a widely adopted technology.

yemoski commented 1 year ago

This has been on their radar too, in a similar way! iodepo/odis-arch: Issue #140

nein09 commented 1 year ago

Just mass-closed issues that are tagged as completed, because that's what you do with completed issues! thanks.