UCSCLibrary / ucsc-library-digital-collections

A rails app based on Hyrax to be used as a repository for UCSC library digital collections.
1 stars 2 forks source link

Inquiry: Can Bulkrax be made to export labels for controlled terms ingested as URIs? #699

Open rmjaffe opened 8 months ago

rmjaffe commented 8 months ago

Descriptive summary

Controlled values (DCMI, LC vocabularies, Getty vocabularies, Geonames) have been entered on and ingested from spreadsheets as URIs. The DAMS does the work of using those URIs to pulling and display appropriate labels. When exporting metadata using Bulkrax, by default it exports controlled values as URIs just as they were entered. Is it possible and/or would it be feasible to use Bulkrax to export labels for controlled values in addition to the URIs?

The request here is to do a preliminary investigation: Is it possible to export labels for controlled terms? If so, what work would be required to make this work?

Background

If migrate our digital collections content to Quartex, that systems has no linked data functionality. For controlled terms in our metadata to display meaningfully to users, we would need to ingest the labels for all the terms -- not URIs. At present, it is not possible to fetch labels for controlled URIs using OpenRefine. The Getty endpoint doesn't allow for fetching; the LC dump that Ned setup is not configured for fetching labels either.

Acceptance criteria

Done means:

rmjaffe commented 7 months ago

@snehagunduraoUL I updated this ticket -- on export, I'm going to need both the URIs and labels.

snehagunduraoUL commented 6 months ago

This can be done either by using the database table (ld_buffers) where the urls and labels are saved. A python script to append the labels to the urls in the csv is ready.

rmjaffe commented 6 months ago

@snehagunduraoUL This is exciting. Let's talk more in our meeting on Wednesday.