gpertea / gffcompare

classify, merge, tracking and annotation of GFF files by comparing to a reference annotation GFF
MIT License
198 stars 32 forks source link

replace '.' class code in .tracking and combined files with a concatenation of participant codes #59

Open gpertea opened 4 years ago

gpertea commented 4 years ago

The '.' class code shown in the .tracking and .combined.gtf files for multiple input files is a result of the transcripts merged there across multiple samples having different relationships (class code) with the reference transcript. Currently gffcompare uses the uninformative "confused" code . but it would be much more informative if a list of the actual codes encountered there should be used instead, e.g. =c if some of the transcripts on that line have code = and others have code c.

The list should be sorted by frequency -- the most frequent class code should be placed first in that concatenated list.