Open saramsey opened 2 years ago
Tagging @finnagin and @dkoslicki for their take. And since the logfile entry referenced a weird graph geometry, tagging @amykglen as well.
This is a problem in the query_graph_info module. But I don't know what the cause is. It looks like a simple query-graph
If query_graph_info is invoked after all the overlay stuff, then it may be getting confused by virtual edge C1. We could add biolink:has_real_world_evidence_of_association_with as an edge to be ignored at line 130. BUT, I'm vaguely thinking that biolink:has_real_world_evidence_of_association_with may sometimes be a real edge, too? We may want to devise a more reliable way to annotate virtual edges so that ARAX can ignore them better. Both in query_graph_info and query_graph_interpreter.
FWIW, KG2.7.4 has 1,027 edges with the biolink:has_real_world_evidence_of_association_with
predicate:
ah, if the same predicate can be used for a virtual edge and a real edge, it will cause some problems here, and it is all the more important to devise a more reliable way to annotate virtual edges
So does query_graph_info always get called after DSL is run? I thought that was something that was called first before the DSL?
I was kinda wondering the same thing. I think I originally wrote it to inspect the incoming query_graph. But then somehow I later inferred that it was being run by resultify. Perhaps to decide what the essence of a result it. In short, I don't know, but it seems so.
FWIW, since CHP/ICEES/COHD all want to use the biolink:has_real_world_evidence_of_association_with
predicate, and this is used by our overlay_clinical_info
, such instances of predicates for real and virtual edges will arise. So question is:
biolink:has_real_world_evidence_of_association_with
edges supposed to be in KG2/Cquery_graph_info
code to ignore virtual edges. I would vote the later b). Since virtual edges are "second class citizens" Looks like this attribute that Chunyu added could be used to identify virtual edges: https://github.com/RTXteam/RTX/issues/1566#issuecomment-903036486
From meeting: plan is to work on this at the next mini hackathon on 1/19
So, when running this DSL query on
arax-backup.rtx.ai
(theproduction
dev-area) last night:[I adapted this query from the test
test_kitchen_sink_api
in the test moduleRTX/code/ARAX/test_production_api/test_ARAX_api.py
.]So, when I ran it, I saw an interesting entry in the logfile from the
query_controller
child process (attached), which seemed to occur in the ranker (?):Is this expected? It seemed to recover from the error and continue on its way. Though there were no results in the end, in this case.
From the interpreted DSL, the query graph was:
arax-query-controller-child-process-gc92r_dt.log.gz