DeepRank / pdb2sql

Fast and versatile biomolecular structure PDB file parser using SQL queries
https://pdb2sql.readthedocs.io
Apache License 2.0
24 stars 12 forks source link

add pdb matching function #84

Open LilySnow opened 2 years ago

LilySnow commented 2 years ago

pdb matching is a very useful step for pdb analysis. It would be nice if we could add this to pdb2sql.

Expected performance:

INPUT:

OUTPUT:

Ideally, we hope to separate pdb_matching into two functions (steps):

Step 1. pdb_match_chn_batch.py: match chain IDs of pdb files to ref.pdb. Output _newChnID.pdb files. Note: This step can be skipped if model.pdb files have already matched chain IDs. This step is also error-prone when multiple chains are highly similar to each other. Therefore, a human visual check is necessary.

Step 2. pdb_renum_batch.py: align and renumber pdb files to ref.pdb. Output _renum.pdb files.

There are two existing solutions:

  1. https://github.com/LilySnow/PDB-matching (python + cpp)
  2. https://github.com/DeepRank/haddock-tools/commit/ed9beee4437a58ecf9dbc7961b38a63cb5b9e282 (python, by the haddock group)

Maybe we could use these solutions as the basis?

github-actions[bot] commented 1 year ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.