gpertea / gclib

GCLib - Genomic C++ library of reusable code for bioinformatics projects
Other
33 stars 13 forks source link

Adding support for CDS classification codes #12

Closed alevar closed 2 months ago

alevar commented 2 months ago

Changes are intended to support the introduction of a new flag '--cds-match' and new codes ':', '_' in gffcompare. New gffcomapre logic is intended to perform validation of CDS chains in addition to the exon chains. Current implementation only addresses CDS matching for '=' and '~' cases. This feature is intended for use with polycistronic transcripts and for comparing annotations with similar exon structures but differing ORFs.

The fix introduces two new classification codes ':' and '_' which replace '=' and '~' when no matching CDS is found.