kba / transkribus-to-prima

Convert Transkribus PAGE-XML to standard PAGE-XML
11 stars 2 forks source link

remove Tag, Property and Link, or transform adequately #3

Closed bertsky closed 2 years ago

bertsky commented 2 years ago

All elements of the structural hierarchy contain an (arbitrary long) sequence of Tag, Property and Link elements in Transkribus. These are invalid under the namespace and original schema.

On first glance, I believe we could transform these into: