Closed kevinschaper closed 1 year ago
Yeah that would be awesome. I think this is key to increase trust in our data products! Thanks for looking at this!
GO Ingest: being updated to not need a map STRING ingest: definitely needs this If OMIM is using mim2gene (it looks like it is?), then it should keep the original subject
@matentzn, do you think it makes sense to store a concatenation of the ZP ID set as an original object?
Hmmmm. Interesting question! Can you bring it up at a data call? I would spontaneously say yes, but.. Not sure would like to hear @cmungall opinion on storing provenance on pre-composed raw data that was internally linked in a post-composed way..
I'm going to resolve this issue, since we're handling this where it's straightforward. The connection between ZP terms and the original post-composition stands on it's own just fine, I hope?
I have recently updated it - it's tied to ZP releases which means we probably need more frequent ZP releases, and also you have to handle cases where there is no link yet between ZFIN and ZP (there will always be some). I would vote for simply dropping.
We should look through the ingests where we currently use maps (OMIM, String, GO Annotations, ZP?) and see if it makes sense to preserve any parts of the original triple using:
https://w3id.org/biolink/vocab/original_subject https://w3id.org/biolink/vocab/original_predicate https://w3id.org/biolink/vocab/original_object