Open baoilleach opened 1 month ago
On Oct 17, 2024, at 19:16, baoilleach @.***> wrote:
I would like these two molecules to be identified as double-cut matched pairs (e.g. with exocyclic fragmentation) ... I'm hoping there's a secret minsize somewhere I can change from 1 to 0, or is support for this likely to require much greater changes?
What a neat case I never considered! No, there's nothing like that option in mmpdb.
I think the quickest way to get something to work is to process the fragdb directly, find all of the 1-cuts, and synthesize the corresponding 2-cut versions. You'll have to ensure the constant_smiles is canonical, which can be approximated by checking if they are different, then testing if X.Y or Y.X is in the database already, and if not picking a preferred order.
This sort of local canonicalization wouldn't work to merge multiple fragdbs, nor as a quick fix to fragment_algorithm.py, but it might get you want you want.
-- Andrew @.***
I would like these two molecules to be identified as double-cut matched pairs (e.g. with exocyclic fragmentation):
The larger molecule has
[*:1]C[*:2]
as the linker, which is fine:The smaller molecule should have
[*:1][*:2]
as the linker, but nothing is found:I'm hoping there's a secret minsize somewhere I can change from 1 to 0, or is support for this likely to require much greater changes?