Closed chrisiacovella closed 3 months ago
The wiki has been updated with a lot of examples and discussion about the hdf5 file format and underlying "data" datastructure passed to the hdf5 file.
https://github.com/choderalab/modelforge/wiki/Dataset-and-curation
I still need to implement the test data set. I had to rerun some calculations.
There appears to be another change in the naming scheme in one of the datasets (Processing SPICE DES370K Single Points Dataset); I need to add in some regex searching to identify this different naming convention and skip all the sorting by conformers ids.
Description
This will add new datasets into model forge, including full spice 1.1.4, preliminary spice 2, ani2x, and the test dataset.
Notes:
Todos
Notable points that this PR has either accomplished or will accomplish.
Status