thkuo / seq2geno

Computational pipeline for genomic features of bacterial population
GNU General Public License v3.0
0 stars 0 forks source link

id-dependent methods fail in case of duplicated ids in gbk #42

Open thkuo opened 6 years ago

thkuo commented 6 years ago

Because the current gbk (used previously by Ariane) includes duplicated gene ids (PA2570.1), which were likely subunits or paralogues, the counting script art2genecount.pl wasn't able to correctly count the read numbers. Acceptable solution: the locus ids in the annotation file should all be unique Be aware of them when developing the ng version