chembl / GLaDOS

Web Interface for ChEMBL @ EMBL-EBI
https://www.ebi.ac.uk/chembl/
Other
44 stars 5 forks source link

MySQL download contains unescaped single quotes #1290

Open dmyersturnbull opened 3 years ago

dmyersturnbull commented 3 years ago

Hi,

This pertains to the ChEMBL 28 MySQL download. It appears not to escape single quotes within strings, so it's very difficult to import.

These types of lines are problems:

''Growth inhibition of human HOP92 cells after 48 hrs by CellTiter-Blue ce' at line 1

Because ' is not escaped, source downloaded_file.sql fails.

(Also: The http server seemed to halt the download partway, leading to downloading incomplete files. I tried download with curl and wget multiple times but ended up with corrupted tar.gz files that were much to small (< 1 GB, often). I was only able to get the files via FTP using /usr/bin/ftp.)

Edit: The create table lines are fine, just escaped by ''.