sul-dlss / purl

URL resolver that translates a reference to a digital object in the form of a druid, into a full content representation of that object as available
Other
8 stars 1 forks source link

Metadata mappings for CORE OAI-PMH #866

Open amyehodge opened 9 months ago

amyehodge commented 9 months ago

Create metadata mappings so that we can provide access to the ETD materials (those submitted via the ETD application, not via H2 or other means) and items in the Stanford University Open Access Articles collection (druid:nk071jz2236) via OAI-PMH to the CORE service (http://core.ac.uk).

See Best practice for CORE harvesting of data providers-v3, particularly section 3.2 on metadata configuration.

amyehodge commented 9 months ago

Proposal for this work can be found in #795.

arcadiafalcone commented 9 months ago

@amyehodge Is there a desire to expand CORE harvesting to non-ETD items in future?

amyehodge commented 9 months ago

Yes. Definitely to the open access and other research publications, but potentially to other text content like non-ETD theses and capstones, grey literature, technical reports, etc. But I'd say OA publications would be top of that expanded list.

arcadiafalcone commented 9 months ago

Recommended CORE meta-tags:

Title and author are already extracted for schema.org tags. Publication date will always be present in the descriptive metadata (an ETD record should include only two dates, publication and copyright, so the correct date may be easily identified). Publisher would be Stanford University for all objects.

arcadiafalcone commented 9 months ago

CORE metadata schemas:

If the focus in on ETDs (metadata automatically generated from a template) and open-access articles deposited via H2 (user-created metadata restricted by interface), a relatively small and constant set of metadata fields need to be mapped.

lwrubel commented 9 months ago

Noting here that our current Dublin Core mapping is suboptimal. When proceeding, we should consider whether it is worth putting effort into improving that or if it would be more effective to use one of the other supported/recommended metadata schemas.

arcadiafalcone commented 8 months ago

@amyehodge @lwrubel Do we already have a sense of which of the three CORE metadata schemas we want to use?

amyehodge commented 8 months ago

@arcadiafalcone This topic came up in a meeting with Tom, Vivian, and Rochelle last week and was noted as a point that needs discussion, and, no we do not already have a sense of which of the three options we would want to use.

arcadiafalcone commented 8 months ago

@amyehodge What information do we need to make the decision? I can do analysis from the ease/completeness-of-mapping standpoint, but am less familiar with what other concerns might be.

amyehodge commented 8 months ago

@arcadiafalcone I think that information would be really helpful if you'd like to start on that. There is a meeting tomorrow where I might be able to at least start trying to suss this out. But ease/completeness-of-mapping will definitely factor in.