I noticed some buggy behavior with seqtk (Version: 1.4-r122) subseq command misfolding sequences for contig headers containing - in list.
As you can see below, contigs/sequences with a hyphen end up with modified headers and single amino acid length:
I noticed some buggy behavior with seqtk (Version: 1.4-r122) subseq command misfolding sequences for contig headers containing
-
in list. As you can see below, contigs/sequences with a hyphen end up with modified headers and single amino acid length:Same thing happens when just using the contig headers as they are from the fasta file:
Correct behavior by removing extra info from contig headers:
Hope this helps prevent issues for other users. Best, Francisco