openmm / spice-dataset

A collection of QM data for training potential functions
MIT License
155 stars 9 forks source link

Create test dataset #98

Closed peastman closed 8 months ago

peastman commented 8 months ago

This script generates a test set for evaluating models trained on SPICE. It tries to measure how well models generalize to new molecules that weren't in the training set, and more specifically how well they generalize to larger molecules than they were trained on.

It includes the following.

There are 10 conformations for each molecule, giving a total of 6000 conformations.