Open Lyu6PosHao opened 6 months ago
Hi, thanks for your interest in our work! I have updated the repo with the ability to process non-connected molecules on inference (commit 507c8f9 and 8c4b7b4). The basic logic is to split the non-connected smiles with '.' and treat each connected subgraph separately. However, I noticed some issues which should be taken care of:
O=C1O[C@H]([C@H](O)CO)C([O-])=C1O.[Na+]
might become O=C1O[C@H]([C@H](O)CO)CO)=C1O.[Na+]
after reconstruction from the molecule object to the SMILES.Thanks a lot!
Great work! But I have some questions:
In ChEBI, some molecules are non-connected, such as
O=C1O[C@H]([C@H](O)CO)C([O-])=C1O.[Na+]
where Na+ is an isolated ion.So I wonder if PS is able to deal with non-connected graph? I tried it and found an error when tokenizing. Thanks.