EHRI / data-validations

Repository for various data validation schemata.
2 stars 2 forks source link

EAD structural analysis #2

Open mikesname opened 8 years ago

mikesname commented 8 years ago

We would like to be able to compare two EAD files structurally with the following (approximate) logic:

VladimirAlexiev commented 8 years ago

cc @boyan-simeonov and @lindareijnhoudt

Hi Mike! Guess this is related to Synchronization. Why not delete all objects ingested from the EAD, and create them anew?

We need to figure out how to do this, since it's an explicit requirement for USHMM reingest, see https://docs.google.com/document/d/1tsprKbISLteIO6sbKMNcnIz14CrKjAoP1mSkXW56dts/edit#heading=h.bpgb083jmll1: "preserve links between parallel descriptions manually added in the portal, and those ingested from USHMM"

About your original question, I'd study features of https://www.oxygenxml.com/xml_diff_and_merge.html and http://www.altova.com/diffdog/xml-diff.html, then look for an open source library that implements part of these features.