Hello - we've started using your KGX bulk download for the NodeNormalizer, and it's been great - thanks for making that!
However, we still have to also query your API because there doesn't seem to be any indication of the "preferred" identifier for each group of equivalent nodes in the bulk download.
For example, the "preferred" identifier for the concept 'water' seems to be PUBCHEM.COMPOUND:962, which is reported under [input_curie] --> "id" --> "identifier" in the NodeNormalizer RestAPI /get_normalized_nodes response:
@cbizon Would it be okay if I added a preferred_id node property to store the preferred id for any clique? Or is there an existing KGX property that would be better suited for this?
Hello - we've started using your KGX bulk download for the NodeNormalizer, and it's been great - thanks for making that!
However, we still have to also query your API because there doesn't seem to be any indication of the "preferred" identifier for each group of equivalent nodes in the bulk download.
For example, the "preferred" identifier for the concept 'water' seems to be
PUBCHEM.COMPOUND:962
, which is reported under[input_curie] --> "id" --> "identifier"
in the NodeNormalizer RestAPI/get_normalized_nodes
response:While in the bulk JSON lines KGX nodes file, I see
equivalent_identifiers
, but no indication of the "preferred" identifier for each cluster:It would really help us out if you could add this. Thanks!