We should come up with strategies for improving choosing the best preferred identifier for conflated chemicals (something better than the smallest identifier).
Some ideas:
Shorted preferred name or shortest synonym if it’s a direct subset of something else
If cliques have SMILES — choosing the clique with the smallest number of components or the smallest molecular weight would help with
We should come up with strategies for improving choosing the best preferred identifier for conflated chemicals (something better than the smallest identifier).
Some ideas: