fedarko / strainFlye

Pipeline for analyzing (rare) mutations in metagenome-assembled genomes
BSD 3-Clause "New" or "Revised" License
8 stars 1 forks source link

Support circular GFF3 features in hotspot detection? #31

Open fedarko opened 2 years ago

fedarko commented 2 years ago

Version 1.26 of the GFF3 spec mentions that circular features can be encoded in a GFF3 file by setting the end coordinate of such a feature to a position greater than the rightmost position in a contig.

We currently don't support this sort of feature in our code, and will raise an error if we see something like this. FWIW, prodigal's gene predictions on the SheepGut dataset don't have this problem at all (although this might be a result of us using the -c option).

Anyway, handling this sort of case is definitely feasible, but will require a bit of extra work. So I'm putting this issue on the backburner for now, in lieu of more important issues; I can address this if there is desire for it.

Things to do to implement support for circular features