biothings / biothings_explorer

TRAPI service for BioThings Explorer
https://explorer.biothings.io
Apache License 2.0
10 stars 11 forks source link

PFOCR result-augmentation: adjust logic #876

Open AlexanderPico opened 3 weeks ago

AlexanderPico commented 3 weeks ago

Per discussion notes and Slack thread with Gus, we have consensus on the following plans:

1. Algorithm - As a general policy/logic, we will identify PFOCR pathway figures that contain all supported-category nodes in the node-bindings (2 in the case of MVP1/2; 3 in the case of Pathfinder). And, obviously, we skip case where there are fewer than 2 supported-category nodes. After passing this minimal requirement, we then include any additional supported-category nodes from the result's aux_graph in the calculation of an enrichment score. This way all results are relevant to the full context of query and are scored to reflect the inclusion of aux_graph nodes as well.

2. Output location - The PFOCR hits can continue to go into message/results/analyses. No change.

3. Structure - Per this issue, we want to include two URLs per hit, like so...

"pfocr": [
    {"matchedCuries": [
         "NCBIGene:8445",
         "NCBIGene:1859",
         "NCBIGene:2932",
         "NCBIGene:2735"
       ],
      "score": 0.2352941176470588,
      "pmc": "PMC2743241",
      "figureUrl": "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2743241/bin/nihms-104435-f0001.jpg",
      "pfocrUrl": "https://pfocr.wikipathways.org/figures/PMC5463358__fnagi-09-00176-g003.html"
    },
colleenXu commented 2 weeks ago

@AlexanderPico

My understanding is that only point 1 has work that needs doing. And Points 2 + 3 are already done (@tokebe to confirm?)

Work to do:

colleenXu commented 2 weeks ago

@AlexanderPico

And I found your "point 1 algorithm wording" a little confusing. Here's my understanding in case it differs from yours...pending @tokebe clarifying things for both of us.

tokebe commented 2 weeks ago

RE @colleenXu's comments:

colleenXu commented 2 weeks ago

@tokebe @AlexanderPico

I've adjusted the titles of this issue and https://github.com/biothings/biothings_explorer/issues/847 to make it clearer what work is being tracked in each.

I think this is correct but feel free to change it if it's not correct.