clarin-eric / parla-clarin

Schema for modelling parliamentary debates
https://clarin-eric.github.io/parla-clarin/
21 stars 6 forks source link

Better explanation of validation #10

Open TomazErjavec opened 4 years ago

TomazErjavec commented 4 years ago

In the Wiki, it is baldly stated that one should use xmllint (also for XInclude expansion). However, xmllint (at least old versions?) has a bug that when the document becomes too big, it starts spewing out errors "validity error : ID ... already defined" for all defined xml:ids, even though they are correctly defined only once.

This needs to be investigated and alternative solutions to validation of large corpora found.

TomazErjavec commented 4 years ago

This docu should also give a pointer to https://github.com/projectEndings/diagnostics and maybe to my own (enhanced) check-links.xsl (which should become part of this git project).

matyaskopp commented 2 years ago

related issue: https://github.com/clarin-eric/ParlaMint/issues/137