keeps / dbptk-developer

DBPTK Developer - library and command-line tool for execution of database preservation actions
http://www.database-preservation.com
GNU Lesser General Public License v3.0
43 stars 19 forks source link

Update SIARDDK schemas or support multiple schema versions #601

Open daniel-skovenborg opened 7 months ago

daniel-skovenborg commented 7 months ago

The version of SIARDDK used in DBPTK is based on the Executive Order no. 1007 of 2010, but the current version of SIARDDK required for archives for the Danish National Archive is based on Executive Order no. 128 of 2020. Cf. this page. There are no model changes in the relevant files, tableIndex.xsd and fileIndex.xml (the first is unchanged, the latter has an addition to the patterns for fileNameType). Thus, for DBPTK this is just a matter of bundling the correct schemas needed for the archive test utility, ADA, to accept the archive.

Unfortunately, the old version is still relevant for systems that began delivering archives to the DNA before 2020. Old users will have to copy the old schemas to the archive – just as new users are currently required to do – unless multiple versions are supported.

The new schemas can be found here. The link is from this page (in Danish) which also contains the link to the old version.

Please notice that there's an error in archiveIndex.xsd which should be fixed if bundling the new version: The xs:schema version attribute contains a tab character which should be removed (it breaks ADA).