tubiana / protocolGromacs

Automatic gromacs protocol from preparation to production with ligand parametrization through
MIT License
59 stars 14 forks source link

DOI

Automatic Gromacs Workflow Script

Author: Thibault Tubiana, PhD
Please read before using this script.

Description

This script is made to facilitate the preparation and production of protein and protein/ligand, MD.
It follows the procedure described for teaching I made at the University of Bergen. You can find lectures content on this page http://tubiana.me/teaching/kjem220-molecular-modelling/ or the pdf describing all the steps on this script here: http://tubiana.me/teaching_files/biocat2020/Tutorial_Gromacs-2019.pdf Fundamental analysis is also generated with gromacs tools (temperature/pressure/rmsd/rmsf/...), and the production trajectories are also cleaned with trjconv (imaging/protein centred/water stripped), but all the original trajectories are kept.
Feel free to make other analysis of course, like trajectory clustering with TTClust https://github.com/tubiana/TTClust πŸ˜‡

Disclamer

Dependencies

Here's a unique command line to create a environment with every depencencies
conda create -n gmx -c conda-forge -c salilab acpype dssp
you can activate the environment with conda activate gmx

How to

  1. Make sure you have all the dependencies
    1. If you have a protein-ligand system, make sure acpype is installed (see parameters)
    2. Gromacs
    3. (optional) DSSP version 3
  2. Clone this repository with the command git clone https://github.com/tubiana/protocolGromacs.git
  3. Put your PDB in the repository
  4. Make the change you need in runGromacs.sh (See parameters)
  5. run the script with bash runGromacs.sh

Parameters

You have to make some changes in the script file (runGromacs).

Workflow

Here's a picture describing the workflow in this script, but you can find more information on each step on my tutorial http://tubiana.me/teaching_files/biocat2020/Tutorial_Gromacs-2019.pdf. You can, of course, modify my script as you want :-)

folder structure

Here's a description of the folder structure after a simulation job:

|-- .                             #--> repo folder, the script, the initial structure and topologie files
    |-- param                     #--> only if ligand is present, will contain receptor and ligand parameters
        |-- receptor              #--> receptor structure and topology
        |-- ligand                #--> receptor topology
            |-- ligand.acpype     #--> ligand topology
    |-- mdp                       #--> original mdp parameters
    |-- replica_X                 #--> simulation for replica number X (if 3 replica, then 3 folders)
        |-- graph                 #--> All the output graph are saved here (rmsd,rmsf,energy.....)
        |-- gro                   #--> Some output structures from MD are saved here
        |-- mdp                   #--> copy of previous mdp folder
        |-- results               #--> contains the MD
            |-- mini              #--> minimisation MD files
            |-- nvt               #--> heationg MD files
            |-- npt               #--> equilibration MD files
            |-- prod              #--> production MD files

Last thing...

Have fun with MD and send me a mail if or open an issue if you have any problems, or just if you used this script and want to thanks me, I will be please to know that it was useful for someone πŸ™‚

Thibault Tubiana.