wurmlab / flo

Same species annotation lift over pipeline.
95 stars 28 forks source link

collision between gene names using a merge of two genome GFF files #14

Closed splaisan closed 6 years ago

splaisan commented 7 years ago

I work on a hybrid yeast made of two known referenced yeasts. I constructed the artificial assembly by merging the fasta files and did the same with the two GFF files from NCBI. When I run flo with that gff and a denovo assembly of the hybrid genome, it dies at the GFF cleaning step producing errors because the two stains have genes with different geneIDs gbIDs but identical canonical gene names.

I thought I was clever by replacing th emerged GFF by the two GFF in the yaml but it fails also (but not due to name)

Is the best solution adding both GFF to the yaml and merging the cleaned results back after lifting?

Thanks in advance, Stephane

PLEASE DELETE this issue: