SysBioChalmers / Sco-GEM

The consensus GEM for Streptomyces coelicolor -
https://sysbiochalmers.github.io/Sco-GEM/
Creative Commons Attribution 4.0 International
3 stars 7 forks source link

feat: subsystem annotations #66

Closed sulheim closed 5 years ago

sulheim commented 5 years ago

Description of the issue:

In iKS1317 most of the reactions were annotated with both KEGG pathway and KEGG subsystem (header for KEGG pathways), see https://www.genome.jp/kegg/pathway.html. A new annotation should be performed to cover the reactions added from Sco4 and iAA1259.

Additionally, for the reactions without a KEGG identifier it should be possible to acquire the pathway from biocyc? The pathway names acquired from BioCyc should then be translated into the namespace of KEGG to make the annotations more homogenous.

If BioCyc doesn't provide this, maybe we can use the metanetx identfiers?

Expected feature/value/output:

As many reactions as possible annotated with pathway / subsystem. The annotations are currently given as kegg.susbsytem and kegg.pathway, and I think we should continue with these annotations. If the pathway annotation is translated from a biocyc-name, we can include as biocyc.pathway annotation as well.

To do:

I hereby confirm that:

edkerk commented 5 years ago

BioCyc pathways are not as neatly formatted as KEGG's. They are typically smaller (so more different pathways) and can have different instances (slight variations): https://biocyc.org/SCO/class-tree?object=Pathways. While not impossible, it is not that trivial to map these to KEGG pathways / subsystems. MetaNetX doesn't seem to connect reactions to KEGG either?

sulheim commented 5 years ago

Your right about MetaNetX at least, no subsystem or pathway annotation is available. However, I think it is possible to bin the BioCyc pathways into the KEGG pathways by combining some automatic mapping based on name with manual curation. It will take some time though

sulheim commented 5 years ago

This is finished, see PR #72