dieterich-lab / DCC

DCC uses output from the STAR read mapper to systematically detect back-splice junctions in next-generation sequencing data. DCC applies a series of filters and integrates data across replicate sets to arrive at a precise list of circRNA candidates.
https://dieterichlab.org/software/
GNU General Public License v3.0
36 stars 20 forks source link

Annotation in develop branch produces different results #12

Closed tjakobi closed 8 years ago

tjakobi commented 8 years ago

While the positions of detected CircRNAs are the same between master and develop branch, the annotation itself differs.

Example:

631069 631291 RP5-857K21.6 0 + intron-exon exon,gene,transcript 1 631069 631369 RP5-857K21.6 0 + intron-exon exon,gene,transcript

1 631069 631291 RP5-857K21.6 0 + intron-exon transcript,gene,exon 1 631069 631369 RP5-857K21.6 0 + intron-exon transcript,gene,exon Additionally to the cases with different annotation order in the type2 column, there are others cases:

1 10095461 10105744 UBE4B 2 + exon-exon CDS,exon,gene,transcript 1 10117459 10119613 UBE4B 2 + exon-exon CDS,exon,gene,transcript

vs.

1 10095461 10105744 UBE4B 2 + exon-intron gene,transcript 1 10117459 10119613 UBE4B 2 + exon-intron gene,transcript

And also cases where the type is different:

1 24514314 24514567 RCAN3 2 + exon-exon CDS,UTR,exon,gene,start_codon,transcript 1 24514314 24531391 RCAN3 2 + exon-exon CDS,UTR,exon,gene,start_codon,transcript

vs.

1 24514314 24514567 RCAN3 2 + exon-intron gene,transcript 1 24514314 24531391 RCAN3 2 + exon-intron gene,transcript

Type exon-exon vs. exon-intron.