digicademy / xtriples

A generic webservice to extract RDF statements from XML resources
http://xtriples.lod.academy
MIT License
18 stars 3 forks source link

Clean/sort/group output #1

Open awagner-mainz opened 8 years ago

awagner-mainz commented 8 years ago

Currently, statements are not grouped by subject - or rather, they are grouped sometimes and not so at some other times. It would be great to have this sorted somehow. One possible starting point for researching this might be Jena's riot command. I have been using it locally for some time now and it compacts the rdf/xml considerably, although I am not sure about how it decides how to group and sort the statements.

The command I am using: riot.sh --strict --check --time --stop --output=rdf/xml-abbrev input.rdf > output.rdf

input.rdf.txt output.rdf.txt

PS. Perhaps this is superfluous if it should become possible to group statements at another point in the extraction process, which may be desirable from a POV of performance anyway.

metacontext commented 4 years ago

As this is at concept stage I would suggest to add it to the 2.x milestone. RDFification ist completely handled by any23 and in my opinion we should avoid (or really have a good reason) to involve other external calls/tools since this will decrease performance and create additional dependencies on the main service. Speaking strictly from the data side of things we would also need to consider where this feature would bring added value beyond "optics". I would favour letting third party systems that receive the data from XTriples (like triple stores of any kind of app that calls the service) handle structuring/ordering the extracted RDF.