Closed callahantiff closed 3 years ago
Verify node identifiers in output metadata, formatting for ensemble transcripts looks a bit off
v2.0.0
, but I think adding this information will increase the usability of the result KG and make it easier for people to extend the KG with new data sourcesedge type
and node type
) that can be used as a high-level way to organize the output data. This should be VERY simple to create for all builds.A brief example of what this would like for node identifier rs201492213
is shown below:
node_type
= 'variant'
edge_type
= ['variant-phenotype', 'variant-gene']
Task Type: CODEBASE
Improve the output metadata for nodes and edges in the knowledge graph
The following items have been condensed from the issues above.
v2.0.0
, but I think adding this information will increase the usability of the result KG and make it easier for people to extend the KG with new data sources edge type
and node type
) that can be used as a high-level way to organize the output data. This should be VERY simple to create for all builds Done as part of #84
Problem: Right now the node metadata that is output is keyed by an identifier, which means if you use the integer edge lists, but want node labels you have to use the provided dictionary that maps node integers to identifiers first.
Solution: In the next iteration, I will add a new column that includes the identifier and the integer. Examples of each output are shown below.