openvax / pyensembl

Python interface to access reference genome features (such as genes, transcripts, and exons) from Ensembl
Apache License 2.0
365 stars 66 forks source link

Error when checking that transcript is complete in ensembl69 #231

Closed scottdbrown closed 4 years ago

scottdbrown commented 4 years ago

Using release 69 of Ensembl, I ran into an issue where some transcripts existed which had an annotated start_codon that only spanned two positions (example ENST00000543092).

Because of this, when .complete() was run, self.coding_sequence would hit an error when it tried to determine the _codon_positions: ValueError: Expected 3 positions for start_codon of ENST00000543092 but got 2

It seems that more recent releases of Ensembl do not have this issue (they no longer list a start_codon for these cases).

Using Python 3.7.3 and Isovar 1.0.10.