rapodaca / dialect

Documenting a subset of the SMILES language.
MIT License
13 stars 0 forks source link

Semantics of stereocenters with undefined configuration #32

Closed rapodaca closed 2 years ago

rapodaca commented 2 years ago

How should a stereocenter with undefined configuration be interpreted?

No SMILES or OpenSMILES documentation explains this point. If it is not addressed, implementations will need to invent their own rules, which can cause data loss.

Options:

  1. Either. One or the other descriptor is present, but which one is unknown.
  2. Mixture. A mixture of configurations of unknown ration is present.
  3. (1) or (2)

For its part, V2000 uses this interpretation: "It could be either of two stereoisomers, or a mixture of the two." In other words, (3).

Given the large number of V2K<->SMILES conversions being performed, and given no clear advantage to any option, (3) makes the most sense.

See also the discussion here.