PathwayCommons / factoid

A project to capture biological pathway data from academic papers
https://biofactoid.org
MIT License
28 stars 7 forks source link

Document: Entity info 'summary' doesn't match the grounding #1301

Open jvwong opened 2 weeks ago

jvwong commented 2 weeks ago

Document case where the entity Summary text does not match the corresponding information provided in the ground:

jvwong commented 2 weeks ago

This is related to #1178

Entities mentioned above have unrelated fields in association. For example, Androgen receptor has association fields :

{
      "id": "c9396485-14ef-4d4c-b778-2adb79c1b503",
      "secret": "read-only",
      "position": {
        "x": 136.69819168076242,
        "y": 153.9117811923705
      },
      "name": "Androgen receptor",
      "description": "",
      "type": "protein",
      "completed": true,
      "association": {
        "charge": 0,
        "combinedOrganismIndex": 2,
        "dbName": "NCBI Gene",
        "dbPrefix": "ncbigene",
        "dbXrefs": [...        ],
        "defaultOrganismIndex": 2,
        "distance": 0,
        "esScore": 10.467124,
        "formulae": [
          "C15H24"
        ],
        "id": "367",
        "inchi": "InChI=1S/C15H24/c1-10-7-8-15-9-12(10)14(3,4)13(15)6-5-11(15)2/h7,11-13H,5-6,8-9H2,1-4H3/t11-,12+,13+,15+/m1/s1",
        "inchiKey": "IRAQOCYXUMOFCW-OSFYFWSMSA-N",
        "mass": 204.35106,
        "monoisotopicMass": 204.1878,
        "name": "AR",
        ...
        "organismName": "Homo sapiens",
        "overallDistance": 200100,
        "shortSynonyms": [
          ...
        ],
        "summary": "A sesquiterpene that is cedrane which has a double bond between positions 8 and 9.",
        "synonyms": [
          ...
          "nuclear receptor subfamily 3 group C member 4"
        ],
        "type": "protein"
      }
    },