TEIC / Stylesheets

TEI XSL Stylesheets
238 stars 126 forks source link

Test RDF output is apparently invalid #214

Open martindholmes opened 7 years ago

martindholmes commented 7 years ago

Test/test12.xml is apparently designed to check TEI conversion to CIDOC CRM in RDF. The expected-results/test12.rdf file, though, has two invalidities according to Oxygen:

On Line 48, the element <rdf:value> has a text node as a direct child; although this appears to be permitted elsewhere in the file, in this context as a child of <E35_Title> it appears to be invalid.

On line 51, the element <F30_Publication_Event> seems to be invalid.

I don't know RDF and CIDOC CRM well enough to start debugging this, but Oxygen is using a file called "rdfxml.rnc" to do the validation; if this turns out to be a trustworthy schema, we could add it to the repo and validate RDF output against it as part of the test.

hcayless commented 7 years ago

Color me deeply skeptical. Validating RDF/XML with a schema is for suckers.

martindholmes commented 7 years ago

If there's no way of knowing whether the output is right or not, what's the point in the test then?

hcayless commented 7 years ago

That is an excellent question.

GVogeler commented 3 years ago

RDF schema validation in Oxygen relies on frameworks\rdf\schema\rdfxml.rnc which describes itself as:

# This schema is for information only and NON-NORMATIVE
#
# It is based on one originally written by James Clark in
# http://lists.w3.org/Archives/Public/www-rdf-comments/2001JulSep/0248.html
# and updated with later changes.

Using an RDF parser is probably the better way to check validity. See https://www.w3.org/RDF/Validator/documentation and https://www.w3.org/People/Barstow/#online_parsers, or http://librdf.org/raptor/rapper.html for instance.