chainer / chainer-chemistry

Chainer Chemistry: A Library for Deep Learning in Biology and Chemistry
MIT License
629 stars 131 forks source link

About atoms 3D geometry and distance matrix for QM9 dataset #341

Closed xptree closed 5 years ago

xptree commented 5 years ago

I am curious that why chainer-chemistry uses rdkit to generate 3D distance matrix instead of using 3D geometry in QM9 dataset directly. I ask this question because the 3D geometry provided by QM9 dataset is believed to be much more accurate than that by rdkit. Do I miss something in the code?

Code details:

corochann commented 5 years ago

Thank you for your interst & question.

I guess DFT need to be executed to calculate precise distance matrix in QM9 dataset, which means that cost for using/calculating precise distance matrix is almost same with calculating labels(energy etc) in QM 9 dataset. Thus we use "rough" distance matrix calculated by rdkit.

And simply, we could not investigate this issue further yet. https://github.com/pfnet-research/chainer-chemistry/issues/287

xptree commented 5 years ago

@corochann Got it, thanks for your reply.