rdkit / mmpdb

A package to identify matched molecular pairs and use them to predict property changes.
Other
197 stars 55 forks source link

Turning on --property flag leading to a smaller number of transformed structures #52

Open chengthefang opened 1 year ago

chengthefang commented 1 year ago

Hi all,

I recently found some unexpected outcomes when using "mmpdb transform" with or without the property flag. The mmpdb database was generated using ChEMBL database with calculated LogP as the property.

When I used the "--no-properties" flag, I got 4632 transformed structures. mmpdb transform chembl.mmpdb --smiles "XXXXXX" --min-pairs 5 --min-variable-size 0 --max-variable-size 20 --no-properties -o results_noprop.csv &

However, when I turned on the "--property LogP" flag, I got 591 transformed structures. mmpdb transform chembl.mmpdb --smiles "XXXXXX" --min-pairs 5 --min-variable-size 0 --max-variable-size 20 --property LogP -o results_prop.csv &

I would expect the code with "--property LogP" generates the same number of compounds but with more output info.

Any thoughts on that?

Thanks! Cheng