uga-libraries / general-aip

This is the general workflow to make archival information packages (AIPs) that are ready for ingest into the UGA Libraries' digital preservation system (ARCHive). The workflow organizes files, extracts and formats metadata, and packages the files. It may be used for any combination of file formats.
Creative Commons Attribution Share Alike 4.0 International
4 stars 0 forks source link

Simplify preservation.xml creation #26

Open amhanson9 opened 9 months ago

amhanson9 commented 9 months ago

Is it possible to use Python instead of XSLT for making and validating the preservation.xml? Python is much more familiar to current developers than XSLT, and it would eliminate two dependencies (saxon and xmllint, which is the only reason we still install Strawberry Perl). It could also cut out intermediate steps for making combined and cleaned FITS XML before getting to preservation.xml.

Would it be possible to use json instead of XML?

amhanson9 commented 9 months ago

Python and XML: in addition to ElementTree, this SAA blog post references using BeautifulSoup and xmlschema to work with EAD.