MobleyLab / chemper

Repository for Chemical Perception Sampling Tools
MIT License
19 stars 10 forks source link

Add utilities for opening files, including a data directory inside chemper installed with pip, and condensed smirks output #30

Closed bannanc closed 5 years ago

bannanc commented 6 years ago

In this pull request I have a jupyter notebook to show some initial attempts at how to shrink the complex SMIRKS patterns currently created by ClusterGraph.

This led to a few improvements and utility functions in the rest of the code:

Below are the things I'm still working on

bannanc commented 6 years ago

As a side note - I recently noticed that you can now label PRs the same as issues, so I added a WIP label and am using that instead of [WIP] in the title.

codecov-io commented 6 years ago

Codecov Report

Merging #30 into master will decrease coverage by 1.39%. The diff coverage is 91.11%.

bannanc commented 6 years ago

OK, this PR is getting giant, but I'm still not done debugging or adding tests for the new code. Some updated:

bannanc commented 5 years ago

As noted in issue #33 the OE license is expired. I'll make a separate PR to add the new one tomorrow, but both tests pass locally and the RDK tests pass here.

bannanc commented 5 years ago

OK, this PR has snow balled A LOT, but I think it is read to go. Included here are a lot of utility functions and the introduction of the Reducer class. I still have some work to do on this class, but I think we need to get this infrastructure merged. The new notebook Generating Reasonable SMIRKS is the best demonstration of how the Reducer class works currently and also includes documentation of what needs to be explored next.

I'll put in a PR with a new OE license in a few minutes, then this will be ready to go.

bannanc commented 5 years ago

@vtlim is going to help me with at least a little code review before I merge this.

vtlim commented 5 years ago

Comments on the ipynb of Generating reasonable SMIRKS.

Description

Step 1

Step 2

Step 3

Step 4

bannanc commented 5 years ago

Thanks @vtlim

For the record on this PR:

bannanc commented 5 years ago

I merged this, there are TODOs in the code and I have some experiments I want to run with the best "moves" in the SMIRKSifier, but those can happen in smaller steps. I did incorporate all of Vickie's feedback in my notebook.