Closed caufieldjh closed 2 years ago
The element molimg
in the structures
table is an image, so when the csv parser hits it, it throws (through Koza) a field larger than field limit
error.
Or, this could be due to the double quotes surrounding the molfile
element, but I think it's the first one.
(It was the first one.)
SonarCloud Quality Gate failed.
The comprehensive collection of drug-specific metadata (including all names and availabilty of formulations) is in the
structures
table of the DrugCentral dump, but it's challenging to parse because some it is values are full structure descriptions, complete with newlines. This PR will handle parsing of this table.This PR also includes manual updates to the SSSOM DrugCentral CURIE map, from extracting the structure IDs + names from the PostgreSQL DB as:
The
id
field is what's referred to asstruct_id
elsewhere.