ModelSEED / ModelSEEDDatabase

This repository contains the definitive copy of the biochemistry and metadata used to construct models using the ModelSEED/ProbAnno approach
Other
53 stars 38 forks source link

Malformed InChI #41

Closed JamesJeffryes closed 5 years ago

JamesJeffryes commented 7 years ago

From @smoretti on October 14, 2016 13:40

We found about 1208 InChI strings that cannot be read by the software Chemaxon from the ModelSEED compound data. E.g.

Copied from original issue: ModelSEED/ModelSEED-UI#60

JamesJeffryes commented 7 years ago

From @smoretti on January 6, 2017 12:48

Also

JamesJeffryes commented 7 years ago

Failure to parse these strings is caused by a suffix appended to the InChI string to prevent certain compounds from being merged. To remedy this I'll split this suffix into a new column and update the InChIs

samseaver commented 7 years ago

We can simply retrieve the original InChIs as listed in the Structures directory rather than attempt to re-parse the ones in the 'structure' column

smoretti commented 7 years ago

Great, thanks