Closed Shiywa closed 10 months ago
First question, about leaders and trailers. Yes, as long as gtf specifies an exon and CDS starts at same coordinate as first exon, there is no leader or trailer for those transcripts. To make fake, just assign in the gtf some coordinates before and after your cds. This is of course not possible if sequences are from single orf sequences (first nucleotide is start site of cds etc). But then there would not make any sense to have leaders anyway.
Second question, so from the count table it finds the transcripts, but the count is 0.
Have your verified that this gets counts ?
countOverlaps(a2[111077:111080], rfp) # rfp is riboseq data loaded (mapped against genome with the viral sequences)
If it does not, then it means no reads map there. Let me know the output of countOverlaps.
no, there no reads mapped.
> countOverlaps(a2[111077:111080], RFP_A_r1)
GFP EGFP NS2A NS4B
0 0 0 0
> countOverlaps(a2[111077:111080], RFP_A_r2)
GFP EGFP NS2A NS4B
0 0 0 0
> countOverlaps(a2[111077:111080], RFP_B_r1)
GFP EGFP NS2A NS4B
0 0 0 0
> countOverlaps(a2[111077:111080], RFP_B_r2)
GFP EGFP NS2A NS4B
0 0 0 0
however, I found reads mapped these four genes in .bam files.
what happened ?
sincerely !
These are old ribo-seq files then, so you must have made new bam files, but not new of the other file formats ?
First Delete your "QC_STATS", "ofst" and "pshifted" folders of the experiment (inside the aligned folder), if you had things here before it will load this instead of using the "new" bam files.
Then do this:
rm(list=ls())
df <- read.experiment("your_exp")
QCreport(df) # Will now make report not using old ofst files or count tables
yes, you are right. we have to delete ofst
files. after delete it, reads number is right
Good, any more questions on that? else I close the issue :)
no
hello Roleren, recently, I was adding some virus genes in human genome, I did it like add fasta sequence in human fasta and make some simple annotation info at the tail of gtf files, just like:
then I noticed that
ORFik
quantified transcripts number in the stepORFikQC(df.rfp)
and need to use thegtf.txdb
file. I aimed to get a new txdb file which include the added information like GFP, NS2A et.Then I tried to make txdb by myself, like:
the txdb file is successfully conducted at ["tx"] ["mrna"] ["cds"] but lost ["leaders"]. maybe because I didn't provide 5'UTR and 3'UTR information for these four genes ?
["mrna] info is like:
Then, I confirmed that there are reads mapped into these genes from my files like
Ribo-Seq_A2_1_Aligned.sortedByCoord.out.bam
et. however, after quantification bycountTable_regions
, all of there genes are 0, like :I'm really confused. Can you give me some advice?
notebly, the added gtf files is not same with real genes but cds region which binding to plasmid, is it related to the result?
sincerely !