MDAnalysis supports a wide range of file formats for reading/writing
coordinates and topologies
(https://userguide.mdanalysis.org/stable/formats/index.html).
However, there are still some file formats that are missing - e.g.
PDBx/mmCIF, which has replaced the PDB format as the default
format for the Protein Data Bank (see Issue
#2367).
We also welcome the addition of support for any other file formats
we are currently missing.
Note that MDAnalysis can support new formats without adding code to
the main package. You can put the code ("reader" and "writer"
classes for trajectories, "parser" classes for topologies) into a MDAKit and
if you use the appropriate base classes, MDAnalysis will automagically
know how to work with the newly supported file format!
Where to start
Identify a file format currently not supported by MDAnaylsis --- if you
don't have a particular format in mind, support for PDBx/mmCIF is a
frequent request.
Create a Kit featuring a new reader/writer class, using ReaderBase/WriterBase,
for the new file format. If there are existing tools that could
aid in reading this format, your Kit could use these as dependencies.
Overview
MDAnalysis supports a wide range of file formats for reading/writing coordinates and topologies (https://userguide.mdanalysis.org/stable/formats/index.html). However, there are still some file formats that are missing - e.g. PDBx/mmCIF, which has replaced the PDB format as the default format for the Protein Data Bank (see Issue #2367). We also welcome the addition of support for any other file formats we are currently missing.
Note that MDAnalysis can support new formats without adding code to the main package. You can put the code ("reader" and "writer" classes for trajectories, "parser" classes for topologies) into a MDAKit and if you use the appropriate base classes, MDAnalysis will automagically know how to work with the newly supported file format!
Where to start
Identify a file format currently not supported by MDAnaylsis --- if you don't have a particular format in mind, support for PDBx/mmCIF is a frequent request.
(See the PDBx/mmCIF software resources, for example, py-mmcif and biopython could also be used, and there exists a Python-only implementation in mmcif_pdbx.)
ReaderBase
/WriterBase
, for the new file format. If there are existing tools that could aid in reading this format, your Kit could use these as dependencies.