Open gglusman opened 12 months ago
Thank you @gglusman for pointing this out. Those are excellent points in my opinion. Here is my analysis of your ticket, please tell me if I understood properly (I would split this ticket accordingly) and feel free to correct any misunderstanding :) :
is input of
is presentis input
of and has input
treats
if supported by only 1 publication@sandrine-m I'm thinking about this a bit differently. The main points:
Text mining: a 1979 paper mentions in passing that someone was initially researching using X for the treatment of Y, and this is interpreted to mean that X treats Y. This is not necessarily a semmed filtering issue. Of note, this is significantly weaker evidence of X actually treating Y than if there is a clinical trial testing the effectiveness of X in the treatment of Y... and we were told we cannot derive 'X treats Y' from the presence of such a clinical trial.
Reasoning: it's not just the 'input of' that is suspect in reasoning. There are several predicates that are hardly transitive in nature but are being interpreted as such. Some intuitive examples:
Yet somehow these two extremely weak things (a text misparsing and a weak inference path) become a top-scoring result. Agreed on the missing EPC bit and the not-clear-how-scoring-works. :)
Thank you! I'll split the ticket accordingly to make sure we treat all points.
I am keeping this original ticket for the reasoning part as I find the discussion in this thread important to keep for this topic. Changing title accordingly.
Another example from the ASM case:
A number of genes increase secretion of dinoprostone and their corresponding things that are claimed to treat ASM. Guilt by association can only go so far...
In #484 Sarah exposed another example of this, with with two drugs being 'part of Animals'... and somehow this being sufficient support to transfer the 'treats' assertion from one to the other.
Thanks @gglusman for associating both issues, that is very helpful for triage. I have created a "transitive" sublabel for all these issues to help go back to them post relay.
Noting #465 as affected by weak transitive logic.
@gglusman do you happen to know what ARA prediction tools reported these problematic support paths? I suspect that is not a manually crafted or automated rule from an ARA like BTE, Unsecret, or ARAGORN, but more likely an arbitrary path through a larger explanation graph like those created by the ARAX reasoner - which are really not meant to be considered independently, outside of the context of this larger graph,
If it is indeed the case that these types of paths are from ARAX - then we will have perhaps isolated the source of the problem, and be better able to find a solution. This would also be interesting to know in upcoming EPC planning discussions about how to weight the strength of evidence provided by support paths with different topologies.
@mbrush We collected a number of issues with the transitive label, and more than one ARA seems to be implicated.
Testing on 2024/01/19, what drugs may treat Alzheimer Disease?
One answer is "Iron, dietary". The explaining path is:
This would be a correct redrawing of the graph:
That is, SLC11A2 causes decreased uptake of both dietary iron and zinc; zinc is claimed to treat AD. There is no support for iron->AD.
from TAQA:
Upon re-re-testing what drugs may treat Familial Pityriasis Rubra Pilaris (on ci), and comparing to the previous test, I see a new result with score 100:
Multiple issues with this result:
This clearly does not provide confident support for the 'retinol treats PRP' assertion on which the 9-cis-Retinal one depends. Yet it gets score 100. The direct 'treats' path cites Aragorn as source. The 'retinol treats PRP' triple cites BTE and TMKP.