This PR is the first PR for this repository. It does two things:
Starts a README describing the goals of the repository, provides background information on the code found in this repo, provides instructions on how to execute the repo, and leaves place holders for future information (e.g. visualization and notebooks).
Includes a snakefile and other associated files to make a snakemake workflow for sourmash commands (see readme for motivations)
a. Snakefile: snakemake workflow that coordinates the execution of sourmash commands on metagenome assemblies. I have run this workflow and can confirm it runs correctly :) Eventually, I will add notebooks that will visualize the output of the workflow, but I wanted to have this portion reviewed before dumping a bunch more code.
b. environment.yml: specifies the run environment for the workflow. See README.md for more information.
c. envs/*yml: environments created and managed by the snakefile (see the conda: directive in each rule to know which environment is used by each step of the workflow.
d. scripts/: folder for auxiliary scripts executed by the snakemake workflow. In this case, it only includes sig_to_csv.py, a python script to convert a sourmash sketch into a csv file.
e. inputs/metadata.csv: metadata file encoding sample names. Used by the snakefile to determine file prefixes.
This PR is the first PR for this repository. It does two things:
Snakefile
: snakemake workflow that coordinates the execution of sourmash commands on metagenome assemblies. I have run this workflow and can confirm it runs correctly :) Eventually, I will add notebooks that will visualize the output of the workflow, but I wanted to have this portion reviewed before dumping a bunch more code. b.environment.yml
: specifies the run environment for the workflow. SeeREADME.md
for more information. c.envs/*yml
: environments created and managed by the snakefile (see theconda:
directive in each rule to know which environment is used by each step of the workflow. d.scripts/
: folder for auxiliary scripts executed by the snakemake workflow. In this case, it only includessig_to_csv.py
, a python script to convert a sourmash sketch into a csv file. e.inputs/metadata.csv
: metadata file encoding sample names. Used by the snakefile to determine file prefixes.