dayures / parrot

Parrot is A RIF and RDF Ontologies documentation Tool. It provides users (both business users and developers) with useful reference documentation about rulesets and ontologies expressed in standard languages, such as OWL and RIF.
Eclipse Public License 1.0
3 stars 1 forks source link

not parsing a URI with BOM (file with BOM) #3

Closed dayures closed 11 years ago

dayures commented 12 years ago

Originally reported by: Carlos Tejo (Bitbucket: carlos_tejo, GitHub: Unknown)


Documenting the URI [1], an Jena exception is raised ("Content is not allowed in prolog."), because it is not possible to parse the RDF/XML document. The reason is that the file 1 has a Byte Order Mark (BOM) [2] and the InputStreamReader does not deal with this situation (it is not removed by the reader). This issue has been reported as an bug [3], but it has not been solved.

A proposal is to use in URLInput.openReader() a customize class as "workaround" [4].

[1] http://ontologies.semanticarts.com/gist/gist6.1.owl [2] http://en.wikipedia.org/wiki/Byte_order_mark [3] http://bugs.sun.com/view_bug.do?bug_id=4508058 (UTF-8 encoding does not recognize initial BOM) [4] http://koti.mbnet.fi/akini/java/unicodereader/


dayures commented 11 years ago

Original comment by Carlos Tejo (Bitbucket: carlos_tejo, GitHub: Unknown):


fixed in commit #225571b

https://bitbucket.org/fundacionctic/parrot/commits/225571bf13223776b1de03afc3dd51ed20ed09d2