chembl / GLaDOS

Web Interface for ChEMBL @ EMBL-EBI
https://www.ebi.ac.uk/chembl/
Other
44 stars 5 forks source link

Extracting 3D structure for ChEMBL compounds #1314

Open jasperhyp opened 1 year ago

jasperhyp commented 1 year ago

Hi! I am interested in assigning 3D coordinates to (atoms in) some 10K compounds I have originally from ChEMBL. This is because as have been shown by many chemoinformatics papers, 3D and 2D structures together can better represent a molecule. This is also emphasized in large-scale molecular representation learning challenges, where 3D structures are estimated from DFT.

However, it seems time-costly to compute DFT with publicly available libraries (e.g. pyscf), or maybe I am missing something. And the faster approach used by RDKit or OpenBabel (force field-based, e.g. MMFF, UFF) generates less accurate 3D coordinates. I noticed that other databases like DrugBank already provides 3D coords for drugs -- are there any downloadable dumps of 3D structures of all (available) ChEMBL compounds?