RMLio / rmlmapper-java

The RMLMapper executes RML rules to generate high quality Linked Data from multiple originally (semi-)structured data sources
http://rml.io
MIT License
146 stars 61 forks source link

DOC: why is sources files not a param on the cli? #166

Open pvgenuchten opened 2 years ago

pvgenuchten commented 2 years ago

This is typically a newbee question, but improving docs will help next generation of newbee's. I was struggling with the fact that i couldn't understand the syntax of the cli, that it didn't have a sources(path) parameter Until i realised that the mappingfile itself has references to the sources files. My suggestion is to dedicate a paragraph to this on the cli-help

On the other hand, would-it be interesting to have an optional parameter -sources, which would override the sources-parameters in the mapping file. a scenario is that you have multiple sets of files (sharing a model) and you want to run the mapper on each of them with a script. challenging off course is how to map a file to a mapping

bjdmeest commented 2 years ago

Thanks for the issue! The general idea is that all mapping configuration is part of the RML mapping, to make it self-contained. However, as RMLMapper-java supports multiple mapping inputs, you could configure a first mapping without reference to the actual source, and add a second mapping that just links your first mapping to a specific source, I gave an example at https://github.com/RMLio/rmlmapper-java/issues/161#issuecomment-1123723597. Please let us know if that helps! I will keep this issue open to remember us to improve the documentation ;).