Closed andimou closed 5 months ago
But here is talking about handling duplicate rows in R2RML when subjectMap is a blank node, what does it need to be adjusted for heterogeneous data? (apart from generalizing the "row" concept)
given we don't have a primary key, what would the default behaviour be?
Would we expect to always check for duplicates? or would we generate a different blank node for each iteration and implementations may reuse the same blank node if they think there's a duplicate? if so, how should an engine indicate that provide such an option?
btw, I think this is an rml-core issue rather than an rml-fno issue
This is about default mapping generation.
IMO the first question to answer here is: are we going to specify default mapping generation in RML?
so the conclusion about this issue after the call was that we allow Blank Nodes without an expression map and we do not clarify what happens if the subject map is referenced by a referencing object map.
If no objections come in this issue or raised during the call, this will be considered as closed.
I think we still need to decide whether or not default mappings are a thing we want to maintain as part of RML.
Are there any current implementations of default mapping generators?
@pmaria if you refer to this R2RML section (https://www.w3.org/TR/r2rml/#default-mappings), I'll remove it for RML. No current implementation of R2RMl or RML has default mapping generators. There were engines like MIRROR https://github.com/oeg-upm/MIRROR, but afaik they were deprecated
Yes, that's what I'm referring too.
I also agree that we can remove this from the spec. If we all agree on this, this issue can be closed, since the issue is about default mapping generators.
@bjdmeest @andimou @DylanVanAssche @chrdebru @frmichel do you agree with this proposal? i.e. no section of default mapping generator in RML-core?
I agree. There are other tools to access non-RDF as triples.
We still need to update the spec. Let's keep this issue open until that is done.
PR with the removal #133
The R2RML spec includes the following paragraph for duplicates handling:
This needs to be adjusted in the case of heterogeneous data