twang15 / Long-read-RNA

0 stars 0 forks source link

Biology-2 #5

Closed twang15 closed 2 years ago

twang15 commented 3 years ago

Splicing mechanism Intron

Intron facilitate evolution of new genes Alternative splicing greatly increases proteome complexity Higher eukaryotes utilize exon definition to define splice site

twang15 commented 3 years ago

A, T, C, G, N

We know that the four native bases for DNA are AGTC, however, some of the sequences, retrieved from NCBI, contain letter 'N', which illustrates that these nucleotide bases are not deciphered correctly, leaving an unidentified nucleotide.

twang15 commented 3 years ago

Chimeric reads occur when one sequencing read aligns to two distinct portions of the genome with little or no overlap. Chimeric reads are indicative of structural variation. Chimeric reads are also called split reads.

twang15 commented 3 years ago

Illumina Sequencing by Synthesis

  1. Library Prep: oligo
  2. Clustering: PCR amplification
  3. Sequencing
  4. Data analysis
    • transfer
    • store
    • align
    • analysis
    • etc
twang15 commented 3 years ago

image015 Picture1 2012131203018_introns detection-of-genetic-motifs-3-638

twang15 commented 3 years ago

mRNA post-transcription modification

  1. RNA-edit: call A to G change if the gene is on plus strand; T to C change if the gene is on minus strand

DNA replication

  1. DNA helicases
    • There are DNA and RNA helicases. DNA helicases are essential during DNA replication because they separate double-stranded DNA into single strands allowing each strand to be copied.
  2. Primase: enzyme to make a small piece of RNA called primer. The primer marks the starting point of the construction of the new strand of DNA
  3. DNA polymerase: enzyme to bind to the primer and make the new strand. It can only add DNA base from 5' to 3' direction. (From perspective of template strand, DNA polymerase moves from 3' to 5')
    • the leading strand is made continuously
    • the lagging strand cannot be made continuously. The DNA polymerase can only make this strand in small chunks, called Okazaki fragment.
      • each fragment starts with a RNA primer
      • then DNA polymerase adds small chunk of DNA bases from 5' to 3' direction
      • the next primer is added further down the lagging strand
      • repeat the above two steps until a whole strand is finished.
      • exonuclease (an enzyme) removes all RNA primers from both strand of DNA and another DNA polymerase fills the gap left behind (occupied by RNA primers) with DNA bases
  4. DNA ligase (an enzyme) seals up the fragments of DNA for both new replicated strands to form continous double strand.
twang15 commented 3 years ago

Stranded vs. Non-stranded RNA

  1. single-read vs. paired-read
  2. DNA vs RNA
  3. can a read Map to both forward strand and reverse strand at the same time?
  4. which strand, forward or reverse, is the reference genome?
  5. cDNA: complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA (e.g., messenger RNA (mRNA) or microRNA (miRNA)) template in a reaction catalyzed by the enzyme reverse transcriptase.

Differential RNA processing

  1. nRNA is pre-mRNA

Galaxy: read mapping

  1. https://training.galaxyproject.org/training-material/topics/sequence-analysis/tutorials/mapping/tutorial.html

RNA-seq

  1. reveal the presence and quantity of RNA in a biological sample at a specific development stage or physiological condition
    • interpret the functional elements of the genome
    • understand development and disease
    • reveal molecular constituents of cells and tissues
  2. Third generation sequencing technology (such as PacBio) enable PCR-free (avoid PCR bias) and full-length transcripts sequencing
twang15 commented 3 years ago

Peptide_syn

twang15 commented 3 years ago

Primary transcript, mRNA, nRNA, tRNA, hnRNA, pre-RNA

Three types of Primary transcript: A primary transcript is the single-stranded ribonucleic acid (RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs.