InformaticsMatters / pipelines-utils-rdkit

Utilities for RDKit-based pipelines
Apache License 2.0
0 stars 0 forks source link

Missing method? #5

Open simonbray opened 4 years ago

simonbray commented 4 years ago

Hi, the filter.fragment method used here (https://github.com/InformaticsMatters/pipelines/blob/master/src/python/pipelines/rdkit/screen.py#L102) does not seem to exist. Running the script results in an ImportError.

Was it available in a previous version of this library?

Simon

ping @lorrainealisha75

tdudgeon commented 4 years ago

Hi, yes, that method was moved. I've fixed this now: https://github.com/InformaticsMatters/pipelines/commit/d90125b5847ff0238c48a0e259c4b8ce3fc183f1

@alanbchristie could you push the changes to PyPi?

simonbray commented 4 years ago

The conda package which we are using is actually built from the GitHub repository rather than PyPI. (See here: https://github.com/conda-forge/im-pipelines-utils-rdkit-feedstock/blob/c7e555f5ed51b71d7924c551205ce5c7cd7de716/recipe/meta.yaml#L9.) So it would be great if a new release could be made on GitHub in addition to the PyPI release.

alanbchristie commented 4 years ago

Hi,

As far as I can see the fix to the pipelines only impacts the informaticsmatters/pipelines repository with Tim’s commit today which adjusts the screen.py pipeline implementation. There’s nothing new I can see for PyPi so the existing PyPi reference should work.

As I’ve not been involved in this topic before, what needs to be done to "make a new release on GitHub” for the pipelines repo other than provide the commit tag (d90125b5847ff0238c48a0e259c4b8ce3fc183f1)?

Ideally we should be creating tags and releases in the pipelines repo but we haven’t done this yet.


Alan

On 27 Jan 2020, at 12:33 pm, Simon Bray notifications@github.com wrote:

The conda package which we are using is actually built from the GitHub repository rather than PyPI. (See here: https://github.com/conda-forge/im-pipelines-utils-rdkit-feedstock/blob/c7e555f5ed51b71d7924c551205ce5c7cd7de716/recipe/meta.yaml#L9 https://github.com/conda-forge/im-pipelines-utils-rdkit-feedstock/blob/c7e555f5ed51b71d7924c551205ce5c7cd7de716/recipe/meta.yaml#L9.) So it would be great if a new release could be made on GitHub in addition to the PyPI release.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/InformaticsMatters/pipelines-utils-rdkit/issues/5?email_source=notifications&email_token=AHDM5TPK7ZLVYF24VYA3MGDQ73IBZA5CNFSM4KLGNDF2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEJ7KVYA#issuecomment-578726624, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHDM5TNRKVIBVNC4WIUPOK3Q73IBZANCNFSM4KLGNDFQ.

simonbray commented 4 years ago

Ah sorry, you're right, there was no change to this repository. So nothing needs to be updated here, since the latest release is in sync with the latest commit.

alanbchristie commented 4 years ago

OK, so there’s nothing more you need me to do?

I’ll discuss strategies for tagging our pipeline repositories going forward, for now we tend to assume the latest ‘master’ is a workable release.

Alan Christie achristie@informaticsmatters.com

On 27 Jan 2020, at 3:46 pm, Simon Bray notifications@github.com wrote:

Ah sorry, you're right, there was no change to this repository. So nothing needs to be updated here, since the latest release is in sync with the latest commit.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/InformaticsMatters/pipelines-utils-rdkit/issues/5?email_source=notifications&email_token=AHDM5TNAMBAPYX7Y6VY7NEDQ736TVA5CNFSM4KLGNDF2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEJ77EHY#issuecomment-578810399, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHDM5TI4UF3UO26QRBFT47DQ736TVANCNFSM4KLGNDFQ.

simonbray commented 4 years ago

No, I don't think so. At least, not at the moment. It would be nice to have official releases of the pipelines repo though.

simonbray commented 4 years ago

Another question: looking at the one of the tests (here: https://github.com/InformaticsMatters/pipelines/blob/master/test-rdkit.sh#L9)

python src/python/pipelines/rdkit/screen.py
  --qsmiles 'C1N=C(C2=CC=CC=C2)C2=CC=CC=C2C2=C1C=NC(NC1=CC=CC=C1)=N2'
  --simmin 0.45
  -if sdf

Is there a reason that a SMILES string is given, but the input format is sdf? Or is there a more up-to-date version of the tests somewhere else?

tdudgeon commented 4 years ago

@simonbray The query structure is specified as smiles, but the molecules to be screened come from a SD file.

simonbray commented 4 years ago

Makes sense, thanks Tim.