Improve the quality of metagenome-assembled genomes by identifying and removing contaminant sequences with an easy-to-use and modular command-line interface.
You can find MAGpurify2's full documentation at its website.
If you use MAGpurify2 in your work, please cite the manuscript where it was first described:
Camargo, Antonio P., et al. "Plant microbiomes harbor potential to promote nutrient turnover in impoverished substrates of a Brazilian biodiversity hotspot" The ISME Journal 17.3 (2023): 354-370.
Install MAGpurify2:
pip install magpurify2
Download test data:
curl -sc /tmp/cookie "${fileId}" > /dev/null
code="$(awk '/_warning_/ {print $NF}' /tmp/cookie)"
curl -Lb /tmp/cookie "${code}&id=${fileId}" -o ${fileName}
tar zxfv magpurify2_test_data.tar.gz
Download database:
curl -sc /tmp/cookie "${fileId}" > /dev/null
code="$(awk '/_warning_/ {print $NF}' /tmp/cookie)"
curl -Lb /tmp/cookie "${code}&id=${fileId}" -o ${fileName}
tar zxfv magpurify2DB.v1.0.tar.gz
Execute the pipeline:
magpurify2 composition test_data/genomes/* output
magpurify2 coverage test_data/genomes/* output --bam_files test_data/bam_files/*
magpurify2 taxonomy test_data/genomes/* output magpurify2DB
magpurify2 filter test_data/genomes/* output filtered_genomes