iris-ywang / chemvae_model

Apache License 2.0
0 stars 0 forks source link

Testing #2

Open iris-ywang opened 3 weeks ago

iris-ywang commented 3 weeks ago

So far we have PA-VAE up until 11340 drugs (see #1 ). The models are qsar-tested on 500 test samples.

For SA-VAE, the models from 12600 - 201600 are all qsar-tested on 4500 test samples.

(the results not looking good)

To do:

iris-ywang commented 3 weeks ago

Note on AE-similarity:

  1. X_decoded is often non-integer. For each row (letter), the highest value/prob is taken as 1 and the rest of the values are put as 0.
  2. X_one_hot_decoded might not be able to be translated back to SMILE because they just don't make sense. So I can't use MFP in RDKIT to compare input SMILES and output SMILES for a AE.