MOZI-AI / annotation-scheme

Human Gene annotation service backend
GNU General Public License v3.0
3 stars 4 forks source link

dataset issues: `(MoleculeNode "ChEBI:nan")` #124

Open linas opened 4 years ago

linas commented 4 years ago

The dataset includes this: (MoleculeNode "ChEBI:nan") which is clearly wrong; we expect an integer here, e.g. (MoleculeNode "ChEBI:15846") This atom appears in 70766 different links; many of these are MemberLinks to pathways.

linas commented 4 years ago

There also seem to be the following invalid pathways:

 ((ConceptNode "SMPD2 ")
 (ConceptNode "SMPD1 ")
 (ConceptNode "SMPD4 ")
 (ConceptNode "SMPD3 ")
)

All SMP pathways should be of the form SMPnnn with nnn being numeric digits.