Open kad-ecoli opened 1 year ago
Thanks for flagging!
@nikitos9000 Let's fix this in our internal and released esmfold infer_pdb
functions simultaneously so new predictions are formatted correctly.
Unfortunately we won't be able to fix the existing predictions in the Atlas.
I have prepared a C++ program at https://github.com/pylelab/USalign/blob/master/pdbAtomName.cpp which can very quickly fix atom name for old ESM atlas pdb files. With this program, it should be pretty easy to fix all existing predictions in a few days. The program can be compiled by
git clone https://github.com/pylelab/USalign.git
cd USalign
make pdbAtomName
Bug description In all PDB files provided by ESM atlas, the atom name at column 13-16 of the ATOM record are always left justified, which is not the standard string formatting specified by the PDB format at https://www.wwpdb.org/documentation/file-format-content/format33/sect9.html#ATOM
Reproduction steps For example, the first residue of MGYP000911143359 is
Expected behavior In standard PDB format, the above residue should have been
Additional context Although the non-standard atom name justification does not affect the visualization of the PDB file, it does affect the some structure analysis tools such as US-align and REDUCE.