tacorna / taco

Multi-sample transcriptome assembly from RNA-Seq
http://tacorna.github.io
Other
22 stars 7 forks source link

Taco refcomp bug? #18

Open mukarram-ak opened 6 years ago

mukarram-ak commented 6 years ago

Hello,

I noticed some abnormalities in the results of taco_refcomp (assembly.refcomp.gtf).

  1. If I understand correctly, category column should be defining what the called transcript is, right? If so, I notice several transcripts which were categorised as "lncrna", albeit having cpat_coding_prob of >0.999.
  2. ref_gene_type should be the gene_type or gene_biotype of the annotated transcript/gene. For example, in one of the transcripts above, the category_relative_detail is intronic_same_strand inside a protein_coding gene. However, the ref_gene_type for that transcript is defines as "lincRNA".

I saw them on few transcripts (not only one) and I have also checked the gtf file I used to compare. Am I missing something here?

yniknafs commented 6 years ago

I can look into this. Any chance you can send me a file snippet so I can try to recreate the error?