monarch-initiative / monarch-ingest

Data ingest application for Monarch Initiative knowledge graph using Koza
https://monarchinitiative.org
14 stars 1 forks source link

kgx transform of monarch ontology is putting subject, object, predicate columns in nodes file #224

Closed kevinschaper closed 2 years ago

kevinschaper commented 2 years ago

In monarch_ingest/pipeline.py we're transforming the ontology json using:

from kgx.cli.cli_utils import transform as kgx_transform
...
    kgx_transform(inputs=["data/monarch/monarch.json"],
                  input_format="obojson",
                  stream=False,
                  output=f"{OUTPUT_DIR}/monarch_ontology",
                  output_format="tsv")

and strangely, we're getting subject, object and predicate columns in our nodes file:

%  head -1 output/monarch_ontology_nodes.tsv
id      category        name    description     xref    provided_by     synonym iri     knowledge_source        object  predicate       relation        same_as subject subsets

I guess we might need to specify output properties?

kevinschaper commented 2 years ago

I think this is probably resolved, but I'm moving it to TODO so that we can double check it