Support for mutating arbitrary sequences

The PDBMutator currently only modifies amino acid residues at the specified positions for a given Protein Data Bank (PDB) id. Rather, we should have the ability to mutate any passed in sequence or structure. First, we have to check if the value passed into the first parameter of PDBMutator().modify_residues() is either (a) valid PDB id, (b) PDB file, (c) an arbitrary structure, or (d) an arbitrary sequence.

Depending on which type is passed in, the mutator will check if it can mutate to the format the user specified. The table details which mutations are compatible with each input type.

Input type	Can mutate to
PDB ID	Primary, Tertiary
PDB/SDF file	Primary, Tertiary
Structure	Primary, Tertiary
Sequence	Primary

Note that, if only a sequence is passed in, the mutator can only modify it to the primary format type. This is because, with only the sequence of residue names, we lose 3D information about the protein. As such, no tertiary amino acid mutations can be made.

ayushkarnawat / profit

Support for mutating arbitrary sequences #7