pyvandenbussche / lov

Linked Open Vocabularies (LOV) - FrontEnd
http://lov.okfn.org/dataset/lov/
60 stars 12 forks source link

Broken triples #69

Closed sharpaper closed 6 years ago

sharpaper commented 6 years ago

$ riot --validate lov.nq

11:22:50 WARN  riot                 :: [line: 73102, col: 74] Lexical form '
    <div>
      The master version of this vocabulary is maintained in <a
      href="https://github.com/listinc/juso-ontology">the GitHub repository</a>
      so others can contribute edits and improvements.
    </div>
    ' not valid for datatype XSD XMLLiteral
11:22:50 WARN  riot                 :: [line: 96339, col: 85] Lexical form '2013-04-25T118:09:00Z' not valid for datatype XSD dateTime
11:22:50 WARN  riot                 :: [line: 96412, col: 1 ] Bad IRI: <http://http://ifgi.uni-muenster.de/~j_tram02/foaf.rdf> Code: 12/PORT_SHOULD_NOT_BE_EMPTY in PORT: The colon introducing an empty port component should be omitted entirely, or a port number should be specified.
11:22:50 WARN  riot                 :: [line: 96413, col: 1 ] Bad IRI: <http://http://ifgi.uni-muenster.de/~j_tram02/foaf.rdf> Code: 12/PORT_SHOULD_NOT_BE_EMPTY in PORT: The colon introducing an empty port component should be omitted entirely, or a port number should be specified.
11:22:50 WARN  riot                 :: [line: 96414, col: 1 ] Bad IRI: <http://http://ifgi.uni-muenster.de/~j_tram02/foaf.rdf> Code: 12/PORT_SHOULD_NOT_BE_EMPTY in PORT: The colon introducing an empty port component should be omitted entirely, or a port number should be specified.
11:22:50 WARN  riot                 :: [line: 96415, col: 1 ] Bad IRI: <http://http://ifgi.uni-muenster.de/~j_tram02/foaf.rdf> Code: 12/PORT_SHOULD_NOT_BE_EMPTY in PORT: The colon introducing an empty port component should be omitted entirely, or a port number should be specified.
11:22:50 WARN  riot                 :: [line: 96416, col: 1 ] Bad IRI: <http://http://ifgi.uni-muenster.de/~j_tram02/foaf.rdf> Code: 12/PORT_SHOULD_NOT_BE_EMPTY in PORT: The colon introducing an empty port component should be omitted entirely, or a port number should be specified.
11:22:50 WARN  riot                 :: [line: 96418, col: 85] Lexical form '2013-04-25T118:09:00Z' not valid for datatype XSD dateTime
11:22:50 ERROR riot                 :: [line: 203552, col: 135] Bad character in IRI (space): <http://securitytoolbox.appspot.com/MASO#Virus[space]...>
sharpaper commented 6 years ago

Besides white spaces, there are also some broken URI subjects. These triples have <> as subject, instead of an absolute URI.

sharpaper commented 6 years ago

A possible fix would be:

gatemezing commented 6 years ago

Hi @sharpaper thanks for spotting this issue. Actually those are "things" inside vocabs' publishers. Will see how to fix the dump. Is this related to https://github.com/pyvandenbussche/lov/issues/66 ?

pyvandenbussche commented 6 years ago

lov.nq and all latest vocabulary files can now be processed without syntax error