BackofenLab / CRISPRloci

13 stars 6 forks source link

CRISPRloci

CRISPRloci provides an automated and comprehensive in silico characterization of CRISPR-Cas system on bacterial and archaeal genomes. It is a full suite for CRISPR locus characteriztion that includes CRISPR array orientation, detection of conserved leaders, cas gene annotation and subtype classification.

The web server interface of CRISPRloci is freely available at: rna.informatik.uni-freiburg.de/trunk/CRISPRloci

GitHub Logo

Citation

If you use CRISPRloci, please cite our papers:

Installation and requirements

CRISPRloci_standalone.py has been tested with Python 3.7 To run it, we recommend installing the same library versions we used. Since we exported our classifiers following the model persistence guideline from scikit-learn, it is not guaranteed that they will work properly if loaded using other Python and/or library versions. For such, we recommend the use of our docker image or a conda virtual environment. They make it easy to install the correct Python and library dependencies without affecting the whole operating system (see below).

First step: download the last version of the tool and extract it

wget https://github.com/BackofenLab/CRISPRloci/archive/1.0.0.tar.gz
tar -xzf 1.0.0.tar.gz

Second step: download the Hidden Markov (HMM) and Machine Learning (ML) models

Due to GitHub's file size constraints, we made our HMM and ML models available in Google Drive. You can download them here and here. Save both tar.gz files inside CRISPRcasIdentifier's directory. It is not necessary to extract them, since the tool will do that the first time it is run.

Third step: download the Hidden Markov (HMM) and Machine Learning (ML) models

We made our HMM and ML models available in Google Drive. You can download them from the following links:

Save all tar.gz files inside Casboundary's folder. It is not necessary to extract them, since the tool will do that the first time it is run.

Fourth step (conda)

First we install Miniconda for python 3. Miniconda can be downloaded from here: miniconda.

Install Miniconda.

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh chmod +x Miniconda3-latest-Linux-x86_64.sh ./Miniconda3-latest-Linux-x86_64.sh

Create and activate environment for CRISPRloci.

conda env create -f CRISPRloci-env.yml -n CRISPRloci-env
conda activate CRISPRloci-env

After using CRISPRloci_standalone.py you can deactivate the environment.

conda deactivate

Quick run with the default parameters

DNA mode

In order to test the dna mode please execute the following the command

python3.7 CRISPRloci_standalone.py -f Example/NC_005230.fasta -st dna

Protein mode

In order to test the protein mode tun the following command:

python3.7 CRISPRloci_standalone.py -f Example/NC_005230_proteins.fasta -st  protein

Viral mode

In order to test the viral more execute the following command

python3.7 CRISPRloci_standalone.py -f Example/Input3.fa -st virus

DNA mode

In order to pick this mode the flag -st should be set to dna

General flags

Cas Protein search related flags

CRISPR array search related flag

Protein mode

In order to pick this mode the flag -st should be set to protein

General flags

Cas Protein search related flags

Viral mode

In order to pick this mode the flag -st should be set to virus

General flags

Search parameter