Bhattacharya-Lab / DConStruct

Hybridized distance- and contact-based hierarchical protein folding
GNU General Public License v3.0
2 stars 0 forks source link
abinitio-simulations protein-folding protein-modeling protein-structure-prediction

DConStruct

Hybridized distance- and contact-based hierarchical protein folding

Installation

Installing DConStruct is very straightforward. The following instructions should work for 64-bit Linux system:

That's it! DConStruct is ready to be used.

Usage

To see the usage instructions, run python DConStruct.py -h


*************************************************************************
*                            DConStruct                                 *
*  Hybridized distance- and contact-based hierarchical protein folding  *
*  For comments, please email to bhattacharyad@auburn.edu               *
*************************************************************************

Usage: DConStruct.py [options]

Options:
  -h, --help  show this help message and exit
  -r RR       rr file in CASP format containing the contact map (mandatory)
  -a AA       fasta file containing the amino acid sequence (mandatory)
  -s SS       secondary structure file (mandatory)
  -m M        MODELLER program path that contains modpy.sh script (mandatory)
  -o OUTPUT   existing output directory path (mandatory)
  -n NO       positive integer to be used as seed (optional); default 7
  -c CTYPE    contact type ca or cb (optional); default cb
  -x L        top xL contacts, where L is the sequence length (optional);
              default 8

File formats and parameters

To run DConStruct with predicted distance-based information, we provide a helper script that can generate distance-based 3-class contact (rr file) from distance histogram (distogram) using rawdistpred.current generated by DMPfold. The script is available here.

Test DConStruct

We give an example of running DConStruct on CASP13 FM target T0968s2.

Create an output directory mkdir output/.

Run python DConStruct.py -r examples/input/T0968s2.rr -a examples/input/T0968s2.fasta -s examples/input/T0968s2.ss -o output/ -c cb -x 8 -m your/modeller/path

Top predicted model will be generated at output/T0968s2_model1.pdb. The predicted 3D model is given here and the output screen should look like this.

DConStruct is reasonably fast. Depending on the sequence length of the target protein, DConStruct takes only a few minutes to a few hours to complete.

Data

  1. Download input data for running DConStruct on FRAGFOLD 150 dataset, CASP12 and CASP13 FM dataset, 510 Membrane protein dataset, and 15 EVfold dataset
  2. Download DConStruct predicted models for FRAGFOLD 150 dataset, CASP12 and CASP13 FM dataset, 510 Membrane protein dataset, and 15 EVfold dataset

Cite

If you find DConStruct useful, please cite our PLOS Computational Biology paper.