wikipathways / pathway-figure-ocr

Extracting gene sets from published pathway figures
Apache License 2.0
15 stars 2 forks source link

Identify PTMs #9

Open ariutta opened 5 years ago

ariutta commented 5 years ago

We could identify PTMs gene products, e.g.: phosphorylation, acetylation and ubiquitination. In slack, @AlexanderPico described the low-hanging fruit we could go after:

One easy target would be when folks name entities with a “-p” or “-P” suffix or prefix. We currently strip these away in our transformations, so I know there are a few.

The more difficult ones to identify would be the gene products with a state attached like a P in a circle, e.g.: Transient_Dendritic_Spine_Growth_following_High-Frequency_Stimulation https://commons.wikimedia.org/wiki/File:Transient_vs._Sustained_Dendritic_Spine_Growth_following_High-Frequency_Stimulation.jpg