POLDER-Crew / polder-federated-search

A federated search project for POLDER.
BSD 3-Clause "New" or "Revised" License
5 stars 1 forks source link

Proof of concept: index CCADI #107

Closed nein09 closed 2 years ago

nein09 commented 2 years ago

https://trello.com/c/xxtod30P

An ideal solution for this would be something like I outline in https://trello.com/c/3hYsa6iE, where you could give an API endpoint template URL to Gleaner, and it would page through the API in an automated way and ingest the JSON-LD that it can get there.

But for now, we can index CCADI by asking Gleaner to crawl its API endpoint, by building a sitemap that pages through it.

I had to slightly modify the SPARQL query that we use in order to get CCADI search results to show up, but otherwise I could just plug it into our system, which says great things about the CCADI developers.