zAMP is a Snakemake cli, written in Snaketool, for amplicon metagenomics anlysis. It includes state-of-the-art tools for performant and reproducibile analysis of 16S rRNA or ITS Illumina paired-end reads.
Starting from local fastq or SRA reads zAMP performs reads QC, ASV inference, taxonomic assignments and basic visualization plots. In addition, zAMP enables training taxonomic classifiers on specific primer-amplified regions in popular databases like greengenes2, SILVA and UNITE to increase sensitivity.
git clone https://github.com/metagenlab/zAMP.git
pip install -e zAMP
Greengenes2
wget http://ftp.microbio.me/greengenes_release/current/2022.10.backbone.full-length.fna.qza
wget http://ftp.microbio.me/greengenes_release/current/2022.10.backbone.tax.qza
Export qza with qiime2 export
docker run -t -i -v $(pwd):/data quay.io/qiime2/tiny:2024.5 \
qiime tools export \
--input-path 2022.10.backbone.full-length.fna.qza \
--output-path greengenes2
docker run -t -i -v $(pwd):/data quay.io/qiime2/tiny:2024.5 \
qiime tools export \
--input-path 2022.10.backbone.tax.qza --output-path greengenes2
Prepare database
zamp db --fasta greengenes2/dna-sequences.fasta \
--taxonomy greengenes2/taxonomy.tsv --name greengenes2 \
--fw-primer CCTACGGGNGGCWGCAG --rv-primer GACTACHVGGGTATCTAATCC \
-o greengenes2
zamp run -i zamp/data/sra-samples.tsv \
-db greengenes2 \
--fw-primer CCTACGGGNGGCWGCAG --rv-primer GACTACHVGGGTATCTAATCC
$ zamp -h
Usage: zamp [OPTIONS] COMMAND [ARGS]...
Snakemake pipeline designed for convenient, reproducible and scalable
amplicon-based metagenomics
For more options, run: zamp command --help
Options:
-v, --version Show the version and exit.
-h, --help Show this message and exit.
Commands:
db Prepare database files for zAMP
run Run zAMP
citation Print zAMP and tools citations