HarveyYan / RNATracker

Prediction of mRNA subcellular localization using deep recurrent neural networks
GNU General Public License v3.0
13 stars 4 forks source link

RNATracker

RNATracker is a deep learning approach to learn mRNA subcellular localization patterns and to infer its outcome. It operates on the cDNA of the longest isoformic protein-coding transcript of a gene with or without its corresponding secondary structure annnotations. The learning targets are fractions/percentage of the transcripts being localized to a fixed set of subcellular compartments of interest.

Our method provides computational-centric insights into the the mRNA trafficking mechanism with identication to the cis-acting zipcodes elements from the transcript sequences.

For what's exactly the RNA trafficking mechanism and its role in the broader gene regulatory network, I find this survey extremely helpful.

RNA localization: Making its way to the center stage

Dataset

Other emerging read-mapping technologies investigating subcellular zipcode proximity might provide additional dataset.

Software dependency

Keras version 2.0.9 is recommeneded. The idea can be easily adapted to other deep leaing frameworks such as Tensorflow and PyTorch.

RNAplfold and forgi libraries from the ViennaRNA package and their python wrapper Eden for acquiring RNA secondary annotations.

TOMTOM for comparing similarity between motifs.

Weblogo and its python wrapper Basset for visualizing learned motifs.

Running the codes

Notes

For secondary structures refer to this customized annotator