Closed glass-ships closed 2 years ago
Here's the extras that I think it still needs:
Curious about something I noticed looking through the output of this ingest - it looks like for rows of genes with multiple publications, the way it's written, there will be duplicate gene nodes. As many as there are publications that mention that gene.
Is that the normal structure for a nodes file? Or should the writer ignore duplicate node?
Create ingest of MGI Gene to Publication data.
Note: this is still on version 0.1.2 of Biolink Model Pydantic.
When that's updated to latest, we'll want to update all tests/scripts to:
NamedThingToInformationContentEntityAssociation
toInformationContentEntityToNamedThingAssociation
predicate.mentions
topredicate.mentioned_by