fedarko / strainFlye

Pipeline for analyzing (rare) mutations in metagenome-assembled genomes
BSD 3-Clause "New" or "Revised" License
8 stars 1 forks source link

Ensure that contig names can be used within filenames #57

Open fedarko opened 2 years ago

fedarko commented 2 years ago

There are a few commands that include contig names in filenames -- right now it's just phasing commands:

In most cases, contig names should be restricted to [a-zA-Z0-9_-.], and should thus be fine as filenames. But I'm sure eventually we'll start seeing weird contig names with spaces or other characters that will mess this up.

I'm not sure it's worth trying to anticipate and address these problems in advance (we could modify the FASTA-loading parts of the code to do some validation on contig names), but I'm making this issue just to catalog what parts of the code this problem touches at the moment.