Closed rwxayheee closed 2 months ago
Comment about issue 1) When filtering with the option max_miss
a new bookmark, bookmarkname+_n
are made, and one bookmarkname+_union
, so the name of the bookmarks are expected. I did fix a bug in how bookmark names are handled while writing SDFs, and I also wrote better error messages.
Issue 2) I do believe the number to compare to is number of poses/results, and not ligands/input files. There might be some scientific discussion there we should take as a group, regarding ligand vs pose. Or am I misunderstanding your comment?
Issue 3) looking into it/might need to be part of scientific discussion mentioned above
Hi @maylinnp Thanks for all the work done. I really appreciate that. Yes I don't think (2) or (3) is a bug. It's more like a changed behavior because of union of interaction combinations. When ligands are selected by discrete types of interactions, only the best pose and at most one entry per ligand will be returned. When the selection is a union, the returned score may not be the best (3), and multiple entries per ligand can be returned (2). It might have something to do with how the union is computed. But if it's by design, then there's no issue with that
First of all thanks everyone for the nice work. I had some personal difficulties trying to export filtered molecules that came from a union of interaction filters (
max_miss
>0) :"( Here's a sample script I run Ringtail with:There are several things I'm a bit unsure about:
(1) Bookmark Name At this point,
produces
This is understandable behavior, although it's not the original bookmark name. But after that when I attempted to
write_molecule_sdfs
, it seemed like the bookmarks couldn't be accessed (?).Gave
(2) Ligand Counting
With
max_miss
>0, ligands may be doubly counted after the union is made.Number passing ligands in max_miss union
may be greater than the number of input files. Not sure if they should be merged (by best energies?) or kept as unique poses...(3) Energies in Result Output
Although
max_miss=4
should include all poses provided, the energy ligand in the results may not be the best Free Energy of Binding observed. For example for 'Val_PDBQT__ZINC00174720_s3', the best in DLG is -6.34. Is this expected? Or should we interpret these filtered results in a different way (they are no longer under a unique ligand name, but belong to a cluster that's defined by a certain interaction pattern)?