willsheffler / rpxdock

protein docking stuff
Other
36 stars 9 forks source link

RPXDock


Version 2 of the venerable tcdock and rpxdock programs in rosetta. This version will be faster, simpler, and mostly in python.

.. figure:: doc/img/general_software_structure.jpg :width: 900px :align: center :height: 400px :alt: alternate text :figclass: align-center

Read the RPXDock manuscript <https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1010680>_.

Running Rpxdock ############### Here is a simple example, tested with python 3.12. The first time you run rpxdock, it will build some C++ files, this takes a few minutes. ::

virtualenv rpxdock_venv source ./rpxdock_venv/bin/activate # if not bash, use appropriate script in this dir

pip install git+https://github.com/willsheffler/rpxdock

python -m pytest --pyargs rpxdock # optional

dock monomers into cyclic oligomers:

python -mrpxdock --architecture C3 --inputs1 rpxdock/data/pdb/C3_1na0-1_1.pdb.gz --dump_pdbs --hscore_data_dir /data/sheffler/data/rpx/hscore/willsheffler/ python -mrpxdock --architecture C17 --inputs1 rpxdock/data/pdb/C3_1na0-1_1.pdb.gz --dump_pdbs --hscore_data_dir /data/sheffler/data/rpx/hscore/willsheffler/

dock cages from cyclic oligomers:

python -mrpxdock --architecture T32 --inputs1 rpxdock/data/pdb/C3_1na0-1_1.pdb.gz --inputs2 rpxdock/data/pdb/C2_REFS10_1.pdb.gz --dump_pdbs --hscore_data_dir /data/sheffler/data/rpx/hscore/willsheffler/ python -mrpxdock --architecture O32 --inputs1 rpxdock/data/pdb/C3_1na0-1_1.pdb.gz --inputs2 rpxdock/data/pdb/C2_REFS10_1.pdb.gz --dump_pdbs --hscore_data_dir /data/sheffler/data/rpx/hscore/willsheffler/ python -mrpxdock --architecture I32 --inputs1 rpxdock/data/pdb/C3_1na0-1_1.pdb.gz --inputs2 rpxdock/data/pdb/C2_REFS10_1.pdb.gz --dump_pdbs --hscore_data_dir /data/sheffler/data/rpx/hscore/willsheffler/

deactivate # leave rpxdock_venv rm -rf rpxdock_venv # delete rpxdock_venv

Useful scripts: ###############

Input preparation for RPXDock: ##############################

Example input pdb files are provided in tools/input/scaffolds/*.pdb. Input .pdb files are provided to the dock.py application using the --inputs1 option. The input can be a path to a single .pdb (e.g., example.pdb) file, or a path with a wildcard (e.g., /path/to/files/*.pdb) can be supplied for multiple inputs. For multicomponent docking, additional inputs can be provided using the --inputs2 and --inputs3 option as necessary. For trajectories with multiple input lists provided to --inputs[n], each object in the list will be sampled against every other object in a partner list. The results for list inputs are batched and ranked together against one another. Thus, the “top” results may not include representatives from every input .pdb. If results from every input are desired, the user can either analyze the entire output list or execute each input or pair of inputs in separate RPXDock trajectories.

To dock two distinct monomers asymmetrically or to form cyclic oligomers, monomeric building blocks should have their center of mass at [0,0,0]. RPXDock will not center the inputs by default, but the --recenter_input option can be passed to translate a monomeric building block such that its center of mass is at [0,0,0]. The final transform values reported are relative to the recentered pose, so it is recommended that inputs are pre-centered if the user plans to use these values.

To form dihedral, stacking, wallpaper, and polyhedral group symmetries such as tetrahedral, octahedral, and icosahedral architectures, the input building blocks must be cyclic oligomers. The input .pdb files must be pre-aligned such that their internal rotational symmetry axes are aligned to the Z axis and the center of mass of the oligomer should be centered at [0,0,0]. It is important to note that the input .pdb files should only contain the asymmetric unit (asu) of the cyclic oligomer rather than the full symmetric building block, as RPXDock will generate the symmetry-related chains. Currently, dihedral docking only supports one-component (i.e., homomeric) architectures; stacking supports two-component architectures; polyhedral group docking supports one-, two-, and three-component architectures; and wallpaper docking supports two- and three-component architectures.

Using Pyrosetta ###############

To run RPXDock, clone this github repo and set up a proper conda environment using the environment.yml file. Note that a user must obtain a pyrosetta license (free for non-profit users) and update the username and password fields for their pyrosetta license in the environment.yml file before creating the environment. Users may need to also install additional packages in their conda environment such as pyyaml and willutil <https://github.com/willsheffler/willutil>_ to properly build the application. To build and compile the codebase with the newly created conda environment, a user may simply run the pytest command using a gcc>9-compatible compiler. To verify that the code compiled properly, execute rpxdock/app/dock.py –-help in the new conda environment. The output should provide a list of options that are relevant for docking.