Degiacomi-Lab / molearn

protein conformational spaces meet machine learning
https://degiacomi.org/software/molearn/
GNU General Public License v3.0
40 stars 11 forks source link

Clustering #19

Closed gwirn closed 3 months ago

gwirn commented 3 months ago

This adds a new class in data to prepare trajectories for training.

It first loads one or multiple trajectories and removes all atoms that are not part of a protein. In the case of multiple trajectories it joins them to one trajectory.

After that it can be used to subsample the trajectories with 3 different methods:

after that it saves a new topology file for the trajectory, the new trajectory as dcd file and a txt file where the indices of the frames of the original trajectory are saved