I have bam files, which were produced with an annotation file, in which chromosome identifies are 1,2,..., X, Y, M, so, not chr1, chr2,..., as in the GENCODE annotation.
I changed the identifiers in the GENCODE annotation file accordingly, then ran collapse_annotation.py on it. No problems occured, a correct collapsed file was produced. But then, when I run rnaseqc, I get the error:
I guess, my modification of annotation file was not correct, because with Ensemble annotation file, which also has 1,2,..., X, Y chromosome identifiers, everything worked.
Sorry
Hi!
I have bam files, which were produced with an annotation file, in which chromosome identifies are 1,2,..., X, Y, M, so, not chr1, chr2,..., as in the GENCODE annotation. I changed the identifiers in the GENCODE annotation file accordingly, then ran collapse_annotation.py on it. No problems occured, a correct collapsed file was produced. But then, when I run rnaseqc, I get the error:
Failed to parse the GTF: Exon missing exon_id and gene_id fields: 1 HAVANA exon 11869 12227 . + . "gene_id ""ENSG00000223972.4""; transcript_id ""ENSG00000223972.4""; gene_type ""pseudogene""; gene_status ""KNOWN""; gene_name ""DDX11L1""; transcript_type ""pseudogene""; transcript_status ""KNOWN""; transcript_name ""DDX11L1""; level 2; havana_gene ""OTTHUMG00000000961.2"";" exon_id "ENSG00000223972.4_1; exon_number 1";
With usual annotation file, rnaseqc runs correctly. I used rnaseqc.v2.4.2.linux
Best, Aleksei