artefactual-sdps / preprocessing-sfa

preprocessing-sfa is an Enduro preprocessing workflow for SFA SIPs
1 stars 0 forks source link

Feature: Add support for Born digital SIPs and update SIP identification rules #34

Closed sallain closed 4 weeks ago

sallain commented 4 months ago

Is your feature request related to a problem? Please describe.

SFA has decided to refine the identification of SIP types, in a way that better reflects the business logic that will be applied to the SIP as it is processed. Instead of Vecteur SIP, GEVER SIP, SIARD SIP, etc., there should be three branches for identification:

Describe the solution you'd like

This should be pretty straightforward. At this time, we are not being asked to add any validation logic for Digital born SIPs beyond the XSD checks already in place - we will just presume that any transfer that makes it to DPS conforms to eCH-0160, so we can treat this as the basic type.

Christa has suggested the following differentiators for Digitized SIPs:

For Digitized AIPs, we can use the same identification check as is currently in place - looking for the additional directory.

Just as a note Digitized AIPs will also have the same PREMIS metadata as Digitized SIPs; however, they do not always have the word "Vecteur" in the AIP name.

Describe alternatives you've considered

A clear and concise description of any alternative solutions or features you've considered.

Additional context

sallain commented 3 months ago

@djjuhasz Samples for all SIP types are available here

fiver-watson commented 3 months ago

Reopening this issue for a low-priority fix - It seems that right now "Vecteur" being in the SIP name is case-sensitive, so a sample like our previous default little_vecteur_sip was failing, until I remade it as little_Vecteur_sip.

It would be nice to make this name check case-insensitive!

jraddaoui commented 3 months ago

This should be fixed now @fiver-watson.