refresh-bio / agc

Assembled Genomes Compressor
MIT License
153 stars 13 forks source link

Error: Pair sample_name:contig_name when using bgzip compressed fasta file as input. #13

Open zhengxinchang opened 9 months ago

zhengxinchang commented 9 months ago

Dear developers,

I am writing to report a potential bug regarding the duplication error of a pair of the sample name and the contig name.

BGZIP-compressed FASTA files will encounter this error, but the FLAT format does not. The combination of the sample name and contig name is very long in my case. So, I suspect that AGC might trim the string when handling them for some reason.

Xinchang