greglandrum / rdkit-blog

RDKit blog
https://greglandrum.github.io/rdkit-blog/
7 stars 2 forks source link

rdkit-blog/posts/2024-01-11-using-abbreviations #21

Open utterances-bot opened 10 months ago

utterances-bot commented 10 months ago

RDKit blog - Using abbreviations in the RDKit

Making more compact structure drawings.

https://greglandrum.github.io/rdkit-blog/posts/2024-01-11-using-abbreviations.html

baoilleach commented 10 months ago

Just a note that the 5th image has the CO2H abbreviation 'the wrong way around' but it's correct in other earlier depictions. Maybe something to look into.

greglandrum commented 10 months ago

Just a note that the 5th image has the CO2H abbreviation 'the wrong way around' but it's correct in other earlier depictions. Maybe something to look into.

Nice catch. The abbreviations code sets an additional property on the atoms which is used when molecules are drawn with the bond coming from the right. This information is not present in the CXSMILES, so the drawing afterwards can't use it.

I think that's going to be non-trivial to address, but at the very least I can update the text in the blog post to make it clear that things aren't actually being "round tripped".