Make it easier for users to supply own reads and reference

harvardinformatics / snpArcher

Snakemake workflow for highly parallel variant calling designed for ease-of-use in non-model organisms.

MIT License

63 stars 30 forks source link

One solution here may be a helper script to organize everything properly. This is what we ended up doing with the old Python-based pipeline; it is a little clunky but not unmanageable. Ideally the helper script would also create the proper sample sheets for the user, perhaps taking fixed values as command line parameters, or reading from a very simple config file.

An alternate solution would to be add some if-then logic to the Snakemake pipeline such that if the fastq files are specified as a full path instead of an accession, use the file at that path, and if the genome is specified as a full path rather than an accession, use that as the reference.

Not sure which would be easier for the user and/or easier to code and maintain.

harvardinformatics / snpArcher

Make it easier for users to supply own reads and reference #21