JetBrains-Research / washu

Reproducible and scalable technical pipelines for ChIP-Seq and RNA-Seq processing
https://artyomovlab.wustl.edu/aging
MIT License
0 stars 0 forks source link

Annotate Reactome using offline database files #35

Open iromeo opened 6 years ago

iromeo commented 6 years ago

Current impl of annotate_reactome.py requires access reactome.org website. Reactome IDs -> Pathways titles annotator could use cached databases downloaded from rectome.org: /mnt/stripe/bio/genomes/ReactomePathways.txt

iromeo commented 6 years ago

Also think of other available annotations and how we can use them.

iromeo commented 6 years ago

Gene to Reactome mapping: “The core unit of the Reactome data model is the reaction. Entities (nucleic acids, proteins, complexes, vaccines, anti-cancer therapeutics and small molecules) participating in reactions form a network of biological interactions and are grouped into pathways. “

Pathways: ReactomePathways.txt Reactions: Ensembl2ReactomeReactions.txt, UniProt2ReactomeReactions.txt, miRBase2ReactomeReactions.txt Associations with other annotations:

Pathways 2 Gene mapping: ReactomePathways.gmt For some reason some pathways missed in ReactomPathays.txt, e.g. R-HSA-157052

iromeo commented 6 years ago

Open Question: Should we consider Reactions or Pathways

(Reactions are not included in Pathways and visa versa)

iromeo commented 6 years ago

Also I think Gene mapping could be done using Ensembl2*.txt file, not manually

olegs commented 6 years ago

@iromeo Please recheck and close if necessary.