uwlib-cams / rml

Using the RML Mapper (https://github.com/RMLio/rmlmapper-java) to convert RDA data to BIBFRAME.
Creative Commons Zero v1.0 Universal
5 stars 2 forks source link

BIBFRAME in Sinopia issue: Duplicate data "in Sinopia." #34

Closed gerontakos closed 3 years ago

gerontakos commented 3 years ago

It appears description sets were loaded May 4 then re-loaded May 11. The May 4 do not display in Sinopia RTs but their IRIs still dereference. The new IRIs do display in Sinopia RTs and do dereference (as JSON-LD) so this appears to be OK; the new IRIs do not display in Sinopia RTs. So nearly-duplicate data is in fact live and dereferencing, but nobody will probably ever know about it. This issue has been created to open a discussion. Probably requires no action.

mcm104 commented 3 years ago

Hm. I'll have to look into this, because I was only loading things into Sinopia Stage until 5/13... Are these stage IRIs, or did the data make it into production somehow?

gerontakos commented 3 years ago

Production. bf:AdminMetadata>bf:creationDate=5/11, bf:AdminMetadata>bf:changeDate=5/13.

mcm104 commented 3 years ago

I'm realizing now some dates are going to be a little different because I was running the transform one step at a time...

Hopefully that explanation makes sense. The creation-5/11-change-5/13 makes sense to me, but anything in production from 5/4 does not... If you see this again, could you send me the IRI?

Edit: I'll also add that the reason for the "official" IRI system was specifically to avoid uploading the same data into production multiple times under different IRIs

gerontakos commented 3 years ago

That's an interesting solution: we assume the RDA identifiers are stable; they do not change. Create a table row each time a BF resource is derived from an RDA resource. When there is new data, or the transform is run again, the lookup of the stable RDA ID is performed to see if the old BF ID should be retained (and a new change date added) or, if the RDA ID is not found, a new BF ID should be created. That should work! I'll close this issue now and if we see a problem, we'll hopefully remember this as a potentially serious issue and possibly re-open.