sul-dlss / FOLIO-Project-Stanford

Task management for Stanford’s analysis of FOLIO.
2 stars 0 forks source link

Determine strategy for getting item/holdings data #579

Closed ahafele closed 2 months ago

ahafele commented 10 months ago

For POD and ShareVDE we need to include holdings and items data.

Options

Data Export: allows you to build profiles that can include holdings and items data in mapped MARC fields. However, you can not use data export profiles to exclude suppressed records. We could maybe export the bibliographic, holdings, items data separately with different metadb reports and link together with record IDs. This could work for ShareVDE, need to check on POD

OAI/PMH: Allows for harvesting of bib, holdings, and items data but https://wiki.folio.org/display/FOLIOtips/OAI-PMH+Settings indicates that time is the finest selection criteria available. This would not work for our selection criteria.

SearchWorks method: Direct database queries. Selection criteria would also be the same. Jeremy was going to investigate this option. Separate ticket #582

Okapi calls?

shelleydoljack commented 10 months ago

I think OAI-PMH to get all and then incrementals going forward, and then having separate scripts or processes to parse those files for the various data export criteria is one strategy to explore. I'm not sure the direct db query we are doing with SW is going to last.

ahafele commented 9 months ago

Update from Share-VDE - can use marcxml files. About the possibility to export data via OAI-PMH: we are using this protocol, for example to manage data coming frm POD (where also Stanford puts its data, if I’m not wrong).

ahafele commented 7 months ago

Current thinking is that data will be retrieved directly from metadb. Close this ticket if others have been created.

jgreben commented 2 months ago

Using SQL and API as needed in airflow app.