samoturk / mol2vec

Mol2vec - an unsupervised machine learning approach to learn vector representations of molecular substructures
BSD 3-Clause "New" or "Revised" License
256 stars 112 forks source link

filter criteria #13

Closed JiangOfCHINA closed 4 years ago

JiangOfCHINA commented 4 years ago

Hi, first, thanks for making this great OSS library, much appreciated.

In the article, it is indicated that only the following elements are allowed to appear in the smiles molecule. Will lowercase letters be included? Some atoms Such as c,o,h,n.... image

I can't download Zinc15. May you provide a way to download it.